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ABSTRACT 

This  report  is  a  brief  handbook  on  the  comparative  descriptive  statistics  of  a  wide  variety  of  skewed 
probability  distributions,  both  continuous  and  discrete.  The  aim  is  to  facilitate  the  comparison  of  differ¬ 
ent  distributions,  for  use  where  random  variables  are  employed  without  any  firm  information  on  their 
distribution.  In  this  situation,  it  is  of  interest  to  look  for  sensitivity  to  the  distribution  chosen.  This  can 
best  be  done  by  running  the  model  with  a  variety  of  distributions,  which  then  raises  the  question  of 
how  to  compare  distributions.  This  work  advocates  the  use  of  moments  and  presents  the  requisite 
equations.  As  obvious  as  this  approach  may  appear,  many  of  the  equations  do  not  seem  to  have  been 
published  previously  and  some  of  the  results  are  apparently  wholly  new.  A  total  of  18  distributions  are 
treated  in  detail,  including  all  of  the  most  commonly  used  skewed  probability  distributions. 
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Comparative  Descriptive  Statistics  of 
Skewed  Probability  Distributions 


Executive  Summary 

This  Report  addresses  a  technical  issue  in  the  current  practice  of  operational  analysis 
(OA).  Increasingly,  OA  studies  involve  simulations  of  varying  levels  of  sophistication. 
A  feature  of  all  simulations  is  the  use  of  random  variables,  and  this  immediately  raises 
the  question  of  what  distribution  to  employ.  Should  the  random  variable  be  taken  as 
uniformly  distributed  over  some  range,  or  should  it  follow  a  bell  curve  with  some 
mean  and  standard  deviation,  or  be  exponentially  distributed,  and  so  on?  Sometimes 
there  are  methodological  or  theoretical  arguments  favouring  a  particular  distribution, 
but  often  there  are  not.  In  their  absence,  the  modeller  is  forced  to  make  a  more-or-less 
arbitrary  choice  of  distribution,  motivated  perhaps  by  ease  of  use  or  personal  familiar¬ 
ity.  It  then  becomes  interesting  to  know  how  sensitive  the  results  of  the  analysis  are  to 
the  choice  of  distribution:  if  another  had  been  chosen,  would  the  conclusions  have  been 
different? 

The  obvious  way  of  approaching  this  question  is  to  run  the  model  using  a  variety 
of  distributions.  One  must  not,  however,  substitute  one  distribution  for  another  uncri¬ 
tically.  For  example,  usually  the  results  of  a  model  will  be  affected  by  the  mean  of  an 
input  variable,  so  one  must  match  means  when  changing  distributions.  However,  most 
common  distributions  have  two  parameters,  so  the  requirement  of  matching  means  is 
not  in  itself  sufficient.  This  Report  advocates  matching  distribution  moments:  for  two- 
parameter  distributions,  one  should  match  means  and  standard  deviations;  with  three 
parameters,  the  skewness  should  also  be  matched,  and  so  on.  As  obvious  as  this  ap¬ 
proach  may  seem,  the  requisite  equations  have  not  been  encountered  in  any  of  the  texts 
on  probability  distributions  examined  for  this  work. 

Hence,  this  Report  gathers  together  equations  for  distribution  parameters  corres¬ 
ponding  to  a  given  mean,  standard  deviation  and,  where  required,  higher  moments  of 
a  wide  range  of  skewed  probability  distributions.  Most  of  the  equations  were  derived 
during  the  course  of  this  work;  many  appear  not  to  have  been  published  before.  All  of 
the  common  skewed  distributions  are  treated  in  detail,  including 

•  3  one-parameter  distributions 

•  11  two-parameter  distributions 

•  2  three-parameter  distributions 

•  1  four-parameter  distribution 

•  9  continuous  distributions  with  the  range  zero  to  infinity 

•  3  continuous  distributions  with  a  finite  range  {a  to  V) 

•  3  discrete  distributions  with  a  finite  range  (a  to  b). 

In  addition,  11  lesser-known  discrete  distributions  with  finite  ranges  are  mentioned,  to 
give  a  flavour  of  the  diversity  available. 

The  emphasis  on  finite-range  and  semi-infinite  (0  — >  oo)  skewed  distributions 
arose  from  the  original  motivation  for  this  work,  which  was  to  support  the  modelling 
of  decision  times  and  times  to  carry  out  a  given  task.  There  is  no  obvious  interpretation 
for  a  negative  decision  time,  so  it  is  natural  to  use  distributions  that  are  positive  defi¬ 
nite  when  modelling  such  quantities.  However,  the  Gaussian  distribution  (defined 
from  -  oo  to  oo)  is  also  included  because  it  is  so  widely  known  and  used. 
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1.  Introduction 

In  many,  perhaps  most,  studies  using  random  variates,  the  question  of  which  distribu¬ 
tion  to  use  is  not  a  significant  issue:  theoretical  or  methodological  arguments  point  to 
the  right  one  for  the  purpose.  The  issue  of  comparison  between  distributions  does  not 
then  arise.  In  some  cases,  however,  there  is  no  clearly  preferred  probability  distribu¬ 
tion;  the  choice  is  more  or  less  arbitrary  and  this  raises  the  question  of  the  sensitivity  of 
the  analysis  to  that  choice.  In  these  circumstances  it  is  important  to  have  a  rational 
basis  for  comparing  distributions,  so  that  one  is  comparing  like  with  like  as  far  as 
practicable.  The  method  advocated  in  this  report  is  to  match  moments  of  distributions: 
where  distributions  have  a  single  parameter,  this  is  chosen  to  equate  means;  for  two- 
parameter  distributions,  means  and  standard  deviations  are  matched,  and  so  on.  This 
seems  an  entirely  obvious  approach,  yet  it  is  not  mentioned  in  any  of  the  many  refer¬ 
ences  examined  in  the  course  of  this  work.  As  a  consequence,  the  requisite  equations 
are  not  compiled  anywhere,  so  this  work  was  carried  out  to  fill  the  gap.  Some  of  the 
results  presented  herein  are  available  in  texts;  most  are  not.  For  example,  nothing  like 
Figures  2.5  or  3.4-3.6  has  been  encountered  in  the  literature. 

Most  of  the  distributions  considered  in  this  report  have  two  parameters.  Equa¬ 
tions  were  derived  and  are  presented  herein  for  parameter  values  that  give  a  desired 
mean  and  standard  deviation.  The  differences  between  distributions  are  then  quanti¬ 
fied  by  skewness  q3  and  kurtosis  r)4,  which  are  respectively  the  third  and  fourth  mo¬ 
ments  about  the  mean  divided  by  the  third  and  fourth  powers  of  the  standard  devia¬ 
tion.  Expressions  for  skewness  and  kurtosis  in  terms  of  mean  and  standard  deviation 
are  presented  where  possible.  Often,  the  expressions  are  more  or  less  straightforward 
re-arrangements  of  standard  results,  but  in  some  cases  the  equations  have  no  closed- 
form  solution;  for  these,  graphs  and  tables  are  presented  that  assist  in  the  choice  of 
parameters. 

In  addition  to  the  two-parameter  distributions,  three  one-parameter,  two  three- 
parameter  and  one  four-parameter  distributions  are  included.  For  these,  equations  are 
presented  giving  the  parameters  in  terms  of  as  many  moments  as  required. 

The  focus  on  skewed  distributions  arose  from  the  original  motivation  for  this 
work,  which  was  to  support  a  study  on  modelling  decision  making  [1].  The  time  taken 
to  make  a  decision  has  a  natural  definition  for  positive  values  only;  hence,  distributions 
of  decision  time  ought  to  be  strictly  zero  for  t  <  0.  Further,  it  has  been  argued  that  the 
likelihood  of  an  instantaneous  decision  is  negligible,  so  the  distribution  of  decision 
times  ought  to  be  zero  at  t  =  0  as  well.  Most  distributions  treated  herein  have  these 
characteristics;  the  two  exceptions  (Gaussian  and  exponential)  are  included  because 
they  are  so  widely  used.  Recently,  the  results  reported  herein  have  been  applied  to  a 
study  of  service  times  in  a  queueing-theory  model  of  maritime  interception  [2].  This  is 
another  case  where  one  requires  the  distribution  to  be  zero  for  t  <  0. 
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This  work  was  prompted  by  studies  from  the  Naval  Undersea  Warfare  Center 
Division  Newport  advocating  the  use  of  the  inverse  Gaussian  probability  density  as  a 
representation  for  decision  times  [3,4]-  This  distribution  is  not  widely  known  and  is 
omitted  from  several  well  known  texts  (e,g,  [5-8])  although  it  was  first  described  over 
half  a  century  ago  [9]  and  its  properties  have  been  documented  for  several  decades  [10 
(ch.l5),ll],  In  §2  of  this  report,  its  properties  are  compared  with  those  of  many  other 
probability-density  distributions.  The  emphasis  naturally  falls  on  other  distributions 
on  [0,co),  although  the  Gaussian  distribution  is  also  included  because  it  is  so  ubiquitous 
and  to  provide  a  point  of  comparison  with  a  non-skewed  distribution. 

Tire  property  that  makes  the  inverse  Gaussian  distribution  particularly  suited  to 
the  description  of  decision  speed,  the  behaviour  of  the  'decision  rate'  or  'hazard  rate'  at 
long  times,  is  highlighted  in  §2,3.4. 

As  well  as  speed,  decisions  are  characterised  by  soundness  [1].  Although  perhaps 
not  essential,  it  is  common  for  soundness  scales  to  be  finite:  0-1  say,  or  0-10.  The  mo¬ 
delling  of  soundness  is  just  as  likely  to  be  probabilistic  as  that  of  decision  speed,  so 
finite-range  distributions  are  described  in  §3.  Often,  soundness  is  ranked  on  a  discrete 
scale,  0-3, 1-5  etc.,  the  so-called  'Likert  scales'.  This  is  also  amenable  to  a  probabilistic 
treatment;  appropriate  probability  distributions  are  described  in  §3.3. 

This  work  serves  as  a  brief  handbook  on  the  comparative  descriptive  statistics  of 
a  wide  variety  of  skewed  probability  distributions,  both  continuous  and  discrete.  The 
aim  is  to  facilitate  the  comparison  of  different  distributions. 


2.  Semi-Infinite  Distributions 

As  mentioned  in  §1,  the  inverse  Gaussian  probability  density  has  been  advocated  for 
representing  decision  speed.  The  basic  properties  expected  of  a  distribution  of  decision 
speeds  are  as  follows: 

•  The  domain  of  the  distribution  must  not  include  negative  times  t. 

•  The  probability  density  should  be  zero  at  t  =  0;  for  otherwise  there  is  a  non-zero 
probability  of  an  instantaneous  decision.  In  particular,  the  exponential  distribution, 
which  has  maximum  probability  density  at  t  =  0,  is  not  suitable. 

•  Whether  the  domain  of  the  probability  density  should  extend  to  infinity  is  not  clear, 
but  there  are  as  yet  no  data  establishing  an  upper  limit  on  decision  time.  Thus,  the 
domain  of  the  distribution  should  be  0  <  t  <  co,  so  as  to  avoid  introducing  an  arbi¬ 
trary  parameter. 

The  inverse  Gaussian  probability  distribution,  among  others,  fulfils  these  criteria.  This 
section  presents  the  main  properties  of  this  distribution  and  compares  it  with  other 
distributions  also  satisfying  the  properties  listed  above.  The  feature  distinguishing  the 
inverse  Gaussian  probability  density  from  the  rest  —  the  behaviour  of  its  decision  rate— 
is  addressed  in  §2.3.4. 


2,1  The  Inverse  Gaussian  Distribution 

In  standard  form,  the  definition  of  the  inverse  Gaussian  probability  density  is  [10-12] 


/iG(fK 


1  x 

X(t  —  g)2 

Lfi exp 

2p2f 

(2.1) 
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where  the  parameters  X,  p  must  both  be  greater  than  zero.  As  noted  above,  the  domain 
is  0  <  t  <  oo.  By  direct  calculation,  it  is  found  that  the  mean  of  the  distribution  is  p  and 
its  variance  a2  equals  p3/ X.  Since  the  function  has  two  parameters,  we  re-write  it  using 
the  mean  p  and  standard  deviation  a  as  parameters.  This  is  straightforward  and  has 
the  added  advantage  that  a  little  rearrangement  simplifies  the  argument  of  the  expo¬ 
nential  function: 


fiG  (t)  = 


exp 


(2.2) 


where  v  =  p/a. 

Figure  2.1  shows  plots  of  the  distribution  for  three  values  of  a/p.  As  Equation 
(2.2)  indicates,  the  argument  of  the  distribution  scales  with  p,  so  it  is  useful  to  plot 
Figure  2.1  with  t / p  on  the  abscissa.  Scaling  is  made  complete  by  plotting  the  product 
p/iG(0  and  labelling  the  curves  with  the  'coefficient  of  variation' s  =  a/p  =  1/v. 

As  may  be  expected,  and  as  Figure  2.1  shows,  the  inverse  Gaussian  distribution 
becomes  more  symmetric  as  a/p  0  and  more  skewed  asa/p^  oo.  The  mode  tm  (for¬ 
mula  in  Table  2.1  below)  lies  between  zero  and  the  mean  p  for  all  values  of  a/p,  with 
tm  -»  0  as  a/p  -»  oo.  The  probability  density /iG(fm)  at  the  mode  rises  as  either  a/p  ->  0 
or  a/p  ->  oo.  The  minimum  value  of  p/iG(fm)  occurs  when  a/p  =  l/fl  «  0.7071.  (This 
does  not  depend  upon  p.) 

As  mentioned  in  §1,  the  method  of  comparing  the  various  distributions  is  to  ex¬ 
amine  functions  with  the  same  values  of  p  and  a;  differences  are  then  be  expressed  in 
terms  of  the  higher-moment  ratios,  skewness  ti3  and  kurtosis  r|4.  Of  course,  yet  higher 
moments  exist,  but  ri3  and  r|4  are  usually  sufficient  to  display  trends.  For  the  inverse 
Gaussian  distribution,  direct  evaluation  of  the  integrals  concerned  gives  simple  expres¬ 
sions  for  q3iG  and  r)4iG  in  terms  of  p  and  a: 


h3iG  =3a/p/X  =  3a/p 

Tl4iG  =3  +  15p/A,  =  3  +  15(a/p)2. 


(2.3) 


Figure  2.1:  Examples  of  the  inverse  Gaussian  distribution.  Curves  are  labelled  by  values  of  the 
coefficient  of  variation  a/p. 
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2.2  Other  Distributions 

This  section  discusses  the  properties  of  the  Gaussian  distribution  and  eight  semi-infi¬ 
nite  distributions:  exponential,  log-normal,  gamma,  bi-exponential,  Weibull,  log-logis¬ 
tic,  inverted  gamma  and  beta-prime.  The  list  comprises  all  the  best  known  distributions 
on  [0,oo),  including  all  those  listed  in  a  widely  used  text  on  simulation  [7(pp.  299-318)]. 
The  Gaussian  and  exponential  distributions  are  not  realistic  candidates  for  describing 
decision  speed,  but  are  included  because  they  are  so  common.  The  Gaussian  also  pro¬ 
vides  a  point  of  reference  to  a  non-skewed  distribution. 

In  the  following  Section,  the  distributions  are  compared  using  the  mean  p  and 
standard  deviation  a  as  the  parameters,  as  in  Equation  (2.2).  Hence,  the  aim  of  this 
Section  is  to  invert  the  expressions  for  p  and  a  in  terms  of  the  standard  parameters. 
This  can  be  done  in  closed  form  in  all  but  two  cases  (Weibull  and  log-logistic).  For 
these  two,  graphs  and  tables  are  presented  to  facilitate  the  transformation.  Details  of 
the  inversion  are  presented  in  the  following  subsections.  For  convenience,  expressions 
for  the  main  properties  of  the  distributions  are  collected  in  Table  2.1  (pp.  6-7). 


2.2.1  Gaussian  Distribution 


The  standard  form  of  the  Gaussian  distribution  already  uses  the  mean  p  and  standard 
deviation  a  as  parameters: 


fc<f) 


-(t-n)2 

2cr2 


(2.4) 


2.2.2  Exponential  Distribution 

This  distribution  has  just  one  parameter,  the  mean  p,  which  must  be  positive.  The  stan¬ 
dard  form  is 


f— ) 

In, 


(0  <  t  <  co). 


(2.5) 


The  standard  deviation  of  this  distribution  equals  p  also. 


2.2.3  Log-Normal  Distribution 

The  standard  form  of  the  log-normal  distribution  is  often  written  using  the  symbols  p 
and  a  to  represent  the  mean  and  standard  deviation  of  a  notional  underlying  Gaussian 
distribution  (e.g.  [6,7,12,13]),  rather  than  of  the  log-normal  distribution  itself.  This  con¬ 
fusing  notation  can  be  simply  avoided  by  adopting  other  symbols.  For  example,  Olkin 
et  al.  [5]  write 


/l-n(f)  = 


-exp 


-(In  t-%Y 


(0<t<oo) 


where  6,  6,  are  the  parameters,  both  of  which  must  be  positive.  The  mean  p  and  stand¬ 
ard  deviation  a  of  the  log-normal  distribution  are  given  by 


p  =  exp^  +  S2/2)/  cj  =  exp(^  +  82/2)-^exp(s2)-l .  (2.7) 

These  equations  can  be  inverted,  leading  to  the  following  expression  for  the  distribu¬ 
tion  in  terms  of  p,  a: 
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1 

iV2«ln(S2+l)eXP 


ln2(fVs2  + 1  /(x) 

21n(s2  +  l) 


where  s  =  a/p,  the  inverse  of  v. 


(2.8) 


2.2.4  Gamma  Distribution 


In  standard  form,  the  gamma  distribution  reads  [5] 


(0  <  t  <  oo). 


(2.9) 


where  the  parameters  r,  0  must  both  be  positive.  F(r)  is  the  usual  gamma  function  [8 
(§6.1.1)].  The  equations  for  the  mean  p  and  standard  deviation  a  in  terms  of  the  para¬ 
meters, 

P  =  r/0,  c2  =  r/02,  (2.10) 

are  easily  inverted,  so  the  distribution  can  be  rewritten  using  these  as  parameters: 


/r(0- 


(2.11) 


where  v  =  p/a,  as  for  the  inverse  Gaussian  distribution. 

To  obtain /r(0)  =  0,  as  required,  one  must  take  r>  1.  This  implies  p/a  >  1.  That  is, 
it  is  not  possible  to  have  the  standard  deviation  larger  than  the  mean  and  still  retain 
/j’(O)  =  0-  This  is  in  contrast  to  the  behaviour  of  the  inverse  Gaussian  and  log-normal 
distributions. 


2.2.5  Bi-exponential  Distribution 

As  noted  above,  the  exponential  distribution  is  not  suitable  for  representing  decision 
speed  because  its  maximum  value  lies  at  t  =  0.  This  defect  can  be  rectified  by  a  simple 
generalisation: 


/b-e(*): 


aP  A-af 


P~a 


)  (0  <  t  <  oo). 


(2.12) 


where  the  two  parameters  a,  p  must  be  non-equal  and  both  positive,  and  one  can  take 
P  >  a  without  loss  of  generality.  This  is  known  as  the  bi-exponential  distribution,  also 
as  the  two-stage  hypo-exponential  or  generalised  Erlang  distribution  [14(p.448)].  Al¬ 
though  this  distribution  has  a  simple  form,  expressions  for  the  mean  and  standard 
deviation  in  terms  of  the  parameters  are  rather  more  complicated  than  for  the  distribu¬ 
tions  treated  so  far: 


a  +  p 

ap  ' 


\/a2  +p2 

ap 


(2.13) 


Equations  (2.13)  can  be  inverted: 


a  = 


2A* 

1  +  a/ 2s2  -1  7 


P  =  - 


2/p 


-V  2s2-: 


(2.14) 


(s  =  a/p,  as  before),  but  is  not  expedient  to  substitute  these  equations  into  Equation 
(2.12).  However,  relatively  simple  expressions  can  be  obtained  for  the  mode,  skewness 
and  kurtosis  in  terms  of  s,  as  shown  in  Table  2.1. 
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Table  2.1  continued  (see  previous  page  for  footnotes). 


Log-logistic 

Inverted  gamma 

Beta  prime  A 

Mode  tm 

p-F1 

Va  +  1 

s2+l 

H3s2+1 

x^1 
q  +  1 

Skewness 

Eqn  (2.24) 

4s 

1-s2 

(a  parameter) 

Kurtosis  r\4 

Eqn  (2.25) 

3(l  + 7s2) 
(l-s2)(l-2s2) 

Eqn  (2.37) 

Behaviour 
atf  =  0 

/( 0)  and  its  first 
[a  -  1J  deriva¬ 
tives  are  zerot 

/( 0)  and  all  its  deriva¬ 
tives  are  zero 

/( 0)  and  its  first 
[p  -  1J  deriva¬ 
tives  are  zerot 

Decision 
ratet  h(t) 

ap2a  t “-1 

[Pv2+1  )f*2e-^Vl  1 

T^P"1  T 

(fa+paf 

f  2  0  V2  +  ll 
+2,p — - — J 

(f  ^ 

h(oo)i 

0 

0 

00 

Equations  (2.14)  imply  that  l/V2  <  cr/p  <  1,  so  that  a  and  p  are  real  and  positive. 
Hence,  as  with  the  gamma  distribution,  the  range  of  standard  deviations  that  can  be 
obtained  for  a  given  mean  is  restricted;  the  range  available  with  the  bi-exponential 
distribution  is  particularly  small,  as  illustrated  in  Figure  2.5  below  (p.  14). 


2.2.6  Weibull  Distribution 

The  standard  form  of  the  Weibull  distribution  is  [5] 


or 


V(f/a)P  (0  <  t  <  oo) 


(2.15) 


where  the  parameters  a,  p  are  both  positive  and  P  >  1  is  required  to  obtain /w(0)  =  0.(a) 
As  with  other  distributions,  expressions  for  the  mean  and  standard  deviation  in  terms 
of  the  parameters  can  be  obtained,  but  in  this  case  they  cannot  be  inverted  in  closed 
form.  The  expressions  are: 


l'' 

p  =  aT|  1  +  — 

'  P 


a  =  a 


1  +  ^ 


1+- 

.  Py 


(2.16) 


Since  we  consider  only  p  >  1  and  since  T(x)  <  1  in  the  range  1  <  x  <  2,  with  a  minimum 
at  r(1.46163)  =  0.885603  [8(p.259)],  it  follows  that  0.8856  a  <  p  <  a.  That  is,  the  mean  lies 
within  12%  of  the  value  of  a. 

It  is  clear  from  Equations  (2.16)  that  the  ratio  s  =  a/p  depends  on  p  alone.  Using 
the  recurrence  and  duplication  formulae  for  the  gamma  function  [8(p.256)],  one  obtains 


s2=  £ 


22/Pp 

flAl 

U  pj 

Vur| 

T+i) 

V  P ) 

-1. 


(2.17) 


(a)  Note  that  a  and  p  are  often  interchanged,  e.g.  [6,7]. 
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From  this  and  the  properties  of  T(x),  one  sees  that  s  — >  0  as  p  — >  oo  and  s  — »  1  as  p  -»  1. 
That  is,  as  with  the  gamma  and  bi-exponential  distributions,  it  is  not  possible  to  obtain 
a  >  p  while  retaining  the  desired  behaviour  near  t  =  0, 

Expressions  similar  to  Equations  (2,16)  for  skewness  ti3  and  kurtosis  ri4  read: 


h3 


1  +  -  -3H  1  + 


1  +  -  +2  T°\  1  +  - 


r  i+- 


i+- 


3/2 


(2.18) 


1  +  - 


-4T  1  + 


p4 


1  +  -  +  6T  1  +  - 


r2fi+-l-3r4fi  +  1 


r  i+-  -r 


i+- 


(2.19) 


There  seems  little  useful  that  can  be  done  to  reduce  these  equations  further.  It  can  be 
seen  from  these  equations  and  those  in  Table  2.1  that  a  acts  purely  as  a  scaling  para¬ 
meter. 

Figure  2.2  shows  plots  of  properties  of  the  Weibull  distribution  as  a  function  of 
the  shape  parameter  p,  The  practical  value  of  this  figure  lies  in  its  facilitation  of  the 
inversion  of  Equations  (2.16)  to  obtain  p  for  given  values  of  a/ p,  a  necessary  step  in  the 
comparison  process.  Figure  2.2  shows  that  1/p  «  a/p,  but  that  a/ p  slightly  exceeds  1/p 
except  at  zero  and  unity.  Table  2.2  lists  values  of  1/P  to  six  decimal  places,  obtained  by 
numerical  inversion  of  Equation  (2.17),  for  selected  values  of  a/  p. 


P 

50  10  5  4  3  2  1 


Figure  2.2;  Various  properties  of  the  Weibull  distribution  as  a  function  of  the  parameter  p  for 
the  case  p  >  1.  The  curves  shozv  the  ratios  t„/a,  p/a  and  a/p,  where  tm  is  the  mode ,  p  the  mean 
and  a  the  standard  deviation,  and  values  of  the  skeivness  pj  and  kurtosis  r\4.  The  skewness  and 
kurtosis  have  been  scaled  by  their  respective  values  atct  =  1  to  give  convenient  ranges  of  values 
for  plotting. 
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Table  2.2;  Values  of  1/ ft  to  6  decimal  places  for  various  values  of  a/p. 


a/p 

1/P 

a/p 

1/P 

a/p 

1/P 

a/p 

1/p 

0.05 

0.040081 

0.30 

0.269268 

0.50 

0.475885 

0.75 

0.742087 

0.10 

0.082281 

1/3 

0.302707 

0.55 

0.529030 

0.80 

0.794755 

0.15 

0.126471 

0.35 

0.319612 

0.60 

0.582383 

0.85 

0.846958 

0.20 

0.172491 

0.40 

0.370972 

0.65 

0.635775 

0.90 

0.898612 

0.25 

0.220157 

0.45 

0.423133 

0.70 

0.689055 

0.95 

0.949645 

2.2.7  Log-Logistic  Distribution 

The  logistic  distribution  has  a  sech2  t  form,  which  has  obvious  similarities  with  the 
Gaussian  distribution.  The  analogy  suggests  the  definition  of  the  log-logistic  distribu¬ 
tion:  it  has  the  same  relationship  to  the  logistic  distribution  as  the  log-normal  has  to  the 
Gaussian  [10(§23.11)].  The  standard  form  of  the  density  function  is  [7] 

fl-l  ( t )  =  aPa  — — - (0  <  t  <  oo),  (2.20) 

(fa+pa) 


where,  for  the  general  distribution,  the  parameters  a,  p  are  both  positive.  However,  as 
with  several  other  distributions,  the  requirement  that/H(0)  =  0  imposes  an  additional 
restriction  on  one  of  the  parameters.  In  this  case,  one  must  have  a  >  1. 

Yet  further  restrictions  on  a  are  placed  by  requirements  that  the  moments  of  the 
distribution  be  finite.  The  rth  moment  pr  about  the  origin  is  given  by 


Mr  =  Pr 


o  (1+y)2 


d y, 


(2.21) 


an  integral  that  is  infinite  unless  a  >  r.  Hence,  a  finite  standard  deviation  requires  a  > 
2,  a  finite  skewness  a  >  3,  and  so  on.  Since  we  plan  to  compare  log-logistic  with  other 
distributions  by  specifying  values  of  mean  and  standard  deviation,  it  follows  that  we 
are  interested  only  in  those  distributions  with  a  >  2. 

Equation  (2.21)  is  a  standard  integral  [15(^856.07)];  its  evaluation  for  r  =  1, 2  leads 
to  the  mean  and  variance: 


7tP  77 

p  =  —  cosec— , 
a  a 


TCp^ 


a  L 


~  27177  O  77 

2  cosec - cosec  — 


a  a 


a 


(2.22) 


As  with  the  Weibull  distribution,  these  equations  cannot  be  inverted  in  closed  form. 
The  value  of  a  is  the  solution  to  the  equation 


s 


(2.23) 


this  is  shown  in  Figure  2.3  as  the  curve  labelled  'a/p'.  Values  of  a  for  selected  values  of 
a/  p  are  listed  in  Table  2.3. 

Expressions  for  the  skewness  and  kurtosis  are  obtained  from  Equation  (2.21)  for 
the  appropriate  values  of  r.  Simplified  as  far  as  seems  useful,  the  results  are 


*131-1 


3(~2 


s  + 


if 


3s2 +1 


s3  [4  -sec2  (77/a)] 


(2.24) 
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Figure  2.3:  Various  properties  of  the  log-logistic  distribution  as  a  function  of  the  parameter  a 
for  the  case  a  >  2,  The  curves  show  the  ratios  tm/$,  jj,/P  and  a/p,  where  tm  is  the  mode,  p  the 
mean  and  a  the  standard  deviation,  and  values  of  the  skewness  r\3  and  kurtosis  r\4. 


Table  2.3:  Values  of  a  to  six  significant  figures  for  various  values  of  a/p. 


a/p 

a 

a/p 

a 

a/p 

a 

a/p 

a 

cr/p 

a 

0.05 

36.3304 

0.30 

6.36448 

0.70 

3.26802 

1.10 

2.58774 

3.0 

2.08824 

0.10 

18.2465 

1/3 

5.79329 

0.75 

3.13310 

1.20 

2.50287 

4.0 

2.05007 

0.15 

12.2542 

0.40 

4.95149 

0.80 

3.01822 

1.40 

2.37947 

5.0 

2.03218 

0.20 

9.28422 

0.50 

4.13744 

0.90 

2.83440 

1.60 

2.29595 

6.0 

2.02240 

0.25 

7.52257 

0.60 

3,61953 

1.00 

2.69535 

2.00 

2.19380 

10.0 

2.00809 

(s2  +  l)3 

12(s2+l)2  6s2 +3 

4- 

n 

11411  s4  2-sec2(7t/a)]  s4  _4-sec2(7t/a)_  s4 


These  are  also  plotted  in  Figure  2.3.  As  this  Figure  shows,  the  higher  moments  of  the 
log-logistic  distribution  are  rather  large  compared  with  distributions  considered  so  far. 
Equations  (2,23)-(2.25)  also  show  that  the  parameter  (3  is  simply  a  scaling  parameter. 

Figure  2.3  indicates  that  the  full  range  of  values  of  a/  p  are  accessible  only  if  infi¬ 
nite  higher  moments  are  tolerated.  If  one  wishes  to  consider  only  distributions  with 
finite  skewness  or  kurtosis,  then  this  introduces  an  upper  limit  on  a/  p:  a/ p<  0.80869 
for  finite  skewness  and  a/ p  <  0.52272  for  finite  kurtosis. 


2.2.8  Inverted  Gamma  Distribution 

The  inverted  gamma  distribution  is  the  distribution  of  a  random  variate  whose  recipro¬ 
cal  is  gamma-distributed.  The  standard  form  of  the  distribution  is  [12(§18.4)] 


fi -r(t) 


(0  <  t  <  oo). 


(2.26) 
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where  the  parameters  X,  c  must  both  be  positive.  Law  and  Kelton  refer  to  this  distribu¬ 
tion  as  'the'  Pearson  type  V  distribution  [7],  but  it  is  actually  just  one  example  of  this 
class  of  distributions  [10(§12.4),16(ch.4)]. 

For  this  distribution,  the  requirement /j.r(0)  =  0  imposes  no  additional  constraint 
on  the  parameters;  however  constraints  arise  when  moments  are  calculated:  for  the  rth 
moment  to  be  finite,  one  requires  c>r,  as  with  the  parameter  a  of  the  log-logistic  dis¬ 
tribution.  Expressions  are  much  simpler  in  the  case  of  the  inverted  gamma  distribution, 
so  that  closed-form  expressions  can  be  obtained  with  p,  cr  as  parameters.  The  transfor¬ 
mation  is 


X  =  p(v2  +  1),  c  =  v2  +  2,  (2.27) 

where  v  =  p/a,  the  reciprocal  of  the  coefficient  of  variation.  The  resulting  expressions 
for  mode,  skewness  and  kurtosis  are  listed  in  Table  2.1  (p.  7).  The  probability  density  is 


fi-r(t)  = 


[p( 


v2+l)] 


:V2+2 


r(v2  + 


2  )f 


w+3 


exp 


-p- 


v2_  +  l' 
t  ; 


(2.28) 


As  with  the  log-logistic  distribution,  the  whole  range  of  values  of  a/  p  is  access¬ 
ible  provided  that  infinite  higher  moments  are  tolerated.  If  one  wishes  to  consider  only 
distributions  with  finite  higher  moments,  then  one  requires  cr/p  <  1  to  keep  the  skew¬ 
ness  finite  and  cr/p  <  1/V2  to  keep  the  kurtosis  finite. 


2.2.9  Beta-Prime  or  Generalised  F  Distribution 

The  P'  distribution  is  derived  from  the  P  distribution  (§3.2  below)  by  a  transformation 
of  the  argument.  It  is  the  only  three-parameter  distribution  on  [0,oo)  considered  in  this 
report.  The  standard  form  is  [7,10(§25.7,§27.8.1)](b) 


x<?  fP-1 

f^t)=B(p,q)  (f  +  x)*** 


(0  <  t  <  oo). 


(2.29) 


where  all  three  parameters  p,  q,  x  must  be  positive  and  B(p,q)  is  the  standard  beta  func¬ 
tion  [8(§6.2)]: 


B(p,q) 


mm 

T{p  +  q)  ' 


(2.30) 


The  requirement /p-(0)  =  0  imposes  the  constraint  p  >1;  further  constraints  arise  from 
the  moments:  a  finite  rth  moment  is  obtained  only  if  q  >  r.  Equation  (2.29)  is  a  gener¬ 
alisation  of  the  much  studied  F  distribution  of  variance  ratios  [10(ch.27)].(c) 

With  three  free  parameters,  one  can  specify  the  skewness  as  well  as  mean  and 
standard  deviation.  An  expression  for  the  rth  moment  pr  about  the  origin  can  be  readi¬ 
ly  derived: 


u  xr(p  +  r-l)'-(<l-r- 1)1 

(p-l)\(q-l)\ 


(2.31) 


Working  from  this,  the  formulae  for  the  three  moments  in  terms  of  the  parameters  are: 


09  As  with  the  inverted  gamma  distribution.  Law  and  Kelton  [7]  misname  this  distribution, 
calling  it  'the'  Pearson  type  VI  distribution,  whereas  it  is  but  one  example  of  the  class  [10 
(§12.4,§27.7),16(ch.4)].  Johnson  et  al.  [10(p.345)j  present  equations  for  the  inversion  of  a  4- 
parameter  generalisation  of  this  distribution. 

(c)  The  F  distribution  is  obtained  by  setting  t  =  q/p,  i.e.  it  has  two  parameters  only. 
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.  W  2  ^2p(p  +  g-l) 

LI  ,  O  ry  f 

<7-1  (?-l)2(?-2) 


(2.32) 


h3  = 


22p+q- if 
sp(q-3)' 


(2.33) 


where  s  =  o/p,  as  before.  Equation  (2.33)  is  independent  of  the  parameter  x,  and  so  is 
identical  to  the  expression  for  the  skewness  of  the  F  distribution,  since  the  F  distribu¬ 
tion  differs  from  the  (3'  distribution  only  in  its  x  values.  Equations  (2.32)  yield  a  simple 
result  for  s: 


,2.  P  +  (J- 1 
P0?-2)' 


(2.34) 


Recalling  that  this  expression  assumes  q  >  2  and  that  we  are  also  taking  p  >1,  we  see 
that  Equation  (2.34)  implies  0  <  a/\i  <  co;  that  is,  the  full  range  of  s  values  is  available. 
However,  the  inclusion  of  Equation  (2.33)  implies  a  further  restriction  of  the  range  of  q 
toq>3,  which  then  restricts  the  range  of  a/p  to  0  <  a/p  <  V3.  The  same  argument  ap¬ 
plied  to  Equation  (2.33)  shows  that  r|3  must  be  positive;  that  is,  negative  skewness  is 
not  accessible  for  this  distribution. 

Equations  (2.32)  and  (2.33)  are  sufficiently  simple  for  the  inversion  to  be  relative¬ 
ly  straightforward; 


1  +  sr|3  —  s 
3Tl3-Sq3+4s2' 


<7 


=  3  +  2 


s2  +  1 
ST13-2S2' 


(2.35) 


with  x  being  calculable  from  the  expression  for  p  once  p  and  q  are  known.  The  require¬ 
ments  p>l,q>3  put  restrictions  on  values  of  t|3  that  may  be  validly  chosen: 


f 


q3  >  max 


2s, 


6s2  -2^ 
3  s-s3 


This  analysis  leaves  the  kurtosis  iq4  as  the  first  moment  available  for  comparison 
with  other  distributions.  Derivation  shows  that  the  expression  for  r\^  is  also  indepen¬ 
dent  of  the  parameter  x  and  so  is  the  same  expression  as  for  the  F  distribution;  that  is 


t]4  —3  +  6 


5q- 11 
(,-3)(,-4) 


,c  fe-TVz) 

p(p+q-l)(q-3)(q-A) 


(2.37) 


where  it  is  assumed  that  q  >  4.  (The  kurtosis  is  infinite  for  q  <  4.)  This  further  restriction 
on  q  carries  with  it  the  implications 

0<s<W2,  r|3  < 4s  +  2/s.  (2.38) 


2.3  Comparisons  and  Comments 

2.3.1  General  Distribution  Properties 

Table  2.1  (pp.  6-7)  comprises  a  comparison  of  sorts  between  the  distributions.  This 
Section  makes  a  more  graphic  comparison  through  plots  of  the  distributions  and  their 
properties.  From  these  plots  and  Table  2.1,  several  specific  points  of  comparison  are 
noted. 

Figure  2.4  shows  examples  of  distributions  with  the  same  mean  and  standard 
deviation.  Comparison  between  the  distributions  is  made  through  three  properties;  the 
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0  1.0  2.0 


—  inverse  Gaussian 
log-normal 

—  gamma 

••  bi-exponential 

-  Weibull 
log-logistic 
inverted  gamma 

-  beta  prime 
Gaussian 


Figure  2.4:  Examples  of  nine  continuous  distributions  xoith  mean  p  of  1.0  and  standard  devia¬ 
tion  a  of  (a)  0.50,  (b)  0.90  and  (c)  1.50.  The  bi-exponential  distribution  is  omitted  from  panel 
(a)  because  it  cannot  have  this  combination  of  mean  and  standard  deviation  (§2.2.5);  several 
distributions  are  omitted  from  panel  (c)  for  the  same  reason.  The  skeiuness  values  chosen  for  the 
(3  'distribution  are  (a,b)  2.0,  (c)  12.0. 


ratio  tm/  p  of  mode  to  mean,  the  skewness  q3  and  the  kurtosis  r|4;  values  of  these  quan¬ 
tities  are  listed  in  Table  2.4.  In  view  of  their  importance.  Figure  2.5  compares  these  pro¬ 
perties  in  a  consistent  manner  by  plotting  their  values  as  a  function  of  a/p.(d)  The  P' 
distribution  is  different  from  the  others  in  having  a  third  parameter,  which  is  taken  in 
§2.2.9  as  being  given  by  the  skewness.  The  range  of  valid  skewness  values  and  corres¬ 
ponding  ranges  of  tm /  p  and  r|4  are  shown  as  shaded  areas  in  Figure  2.5. 

Figure  2.4  indicates  the  extent  to  which  the  distributions  resemble  one  another, 
particularly  at  low  a/p.  As  Figure  2.5  shows,  the  properties  of  all  except  the  Weibull, 
log-logistic  and  p'  distributions  approach  those  of  the  Gaussian  as  a/p  ->  0.  However, 
the  following  points  of  contrast  and  difference  are  noted: 

•  The  inverse  Gaussian,  log-normal  and  log-logistic  distributions  have  quite  similar 
behaviour  near  t  =  0  (Fig.  2.4).  Compared  with  these,  the  inverted  gamma  distribu¬ 
tion  approaches  zero  more  quickly  and  all  the  rest  less  quickly  as  f  — >  0. 

(d)  For  the  Weibull  and  log-logistic  distributions.  Equations  (2.16)  and  (2.23)  cannot  be  inverted 
to  obtain  closed-form  equations  for  the  parameters  as  a  function  of  a/p,  but  it  is  clearly  pos¬ 
sible  to  construct  plots  of  the  distribution  properties  against  a/p  from  the  data  in  Figures  2.2 
and  2.3. 
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Table  2,4:  Values  of  the  ratio  tm/p  of  mean  to  mode,  skewness  r\3  and  kurtosis  t\ 4for  the  distri¬ 
butions  shown  in  Figure  2,4,* 


ct/(j  =  0.5  ct/  p  =  0.9  a/  p  =  1.5 


inverse  Gaussian 

0.693 

1.50 

6.75 

0.359 

2.70 

log-normal 

0.716 

1.63 

8.04 

0.411 

3.43 

gamma 

0.750 

1.00 

4.50 

0.190 

1.80 

bi-exponential 

- 

- 

- 

0.257 

1.96 

Weibull 

0.830 

0.57 

3.13 

0.121 

1.70 

log-logistic 

0.805 

3.86 

244.0 

0.623 

00 

inverted  gamma 

0.714 

2.67 

22.0 

0.528 

18.9 

0.145 

4.50 

0.171 

7.87 

A  dash  indicates  that  the  value  of  a/  p  is  not  valid  for  that  distribution. 
Chosen  parameter  value. 


(a)  mode  relative 
to  mean 


inverse  Gaussian 
log-normal 
gamma 
bi-exponential 


log-logistic 
inverted  aamma 


>a 
xf 

beta  prime 


1 1 


A”' 


(b)  skewness 


c)  kurtosis 


Figure  2.5:  Comparison  of  several  properties  of  tlie  continuous  distributions:  (a)  ratio  tn/p  of 
mode  to  mean,  (b)  skewness  r|3  and  (c)  kurtosis  r\4  are  shoivn  as  a  function  of  a/p.  Note  that  the 
gamma  and  Weibull  distributions  are  limited  to0<  a/p  <  1,  and  the  bi-exponential  distribution 
to  l/V2  <  a/p  <  1.  The  exponential  distribution  has  a/p  =  1  only.  The  P'  distribution  has  three 
parameters,  and  so  its  properties  have  a  range  of  values  for  any  given  value  of  a/p.  Skeumess  is 
taken  as  the  third  parameter;  the  shaded  region  in  panel  (b)  shows  the  range  ofr\3  values  satisfy¬ 
ing  Equations  (2.36)  and  (2.38).  The  shaded  areas  in  the  other  panels  indicate  the  corresponding 
ranges  of  values  of  t„/p  and  although  this  representation  does  not  shoio  the  detailed  corres¬ 
pondence  between  points  in  these  ranges. 
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•  This  difference  in  behaviour  is  due  to  the  behaviour  of  the  derivatives  of  the  distri¬ 
butions  as  t  =  0.  There  is  a  clear  distinction  between  those  that  have  all  derivatives 
zero  at  regardless  of  parameter  values  and  those  for  which  only  low-order  deriva¬ 
tives  are  zero  (Table  2.1).  Only  the  inverse  Gaussian,  log-normal  and  inverted  gam¬ 
ma  distributions  fall  into  the  first  category. 

•  In  general,  the  inverse  Gaussian  distribution  is  less  peaked  and  falls  less  quickly  as 
f  ->  oo  than  the  log-logistic  and  inverted  gamma  distributions,  with  the  opposite 
applying  for  all  the  other  distributions  (Fig  2.4). 

•  The  Weibull  distribution  is  unique  in  having  regions  of  a/p  where  the  mode  lies  at 
larger  f  values  than  the  mean,  the  skewness  is  negative  and  the  kurtosis  is  less  than 
the  value  of  3.0  that  applies  to  the  Gaussian  (Fig.  2.5).  Of  these  three  features,  one  is 
shared  by  the  log-logistic  distribution,  for  which  fm/p  >  1  at  small  values  of  a/p. 

•  The  Weibull  distribution  is  also  unique  in  that  its  skewness  remains  non-zero  as 
a/p  — » 0  (Fig.  2.5c). 

•  As  a/p  ->  0,  the  kurtosis  of  most  distributions  approaches  3.0,  the  value  for  the 
Gaussian  distribution.  The  Weibull  and  log-logistic  distributions  are  the  two  ex¬ 
ceptions  to  this  behaviour  (Fig.  2.5d).  The  log-logistic  distribution  does  not  have  a 
kurtosis  as  small  as  3.0  for  any  values  of  its  parameters. 

•  The  log-logistic  and  inverted  gamma  distributions  have  markedly  higher  skewness 
and  kurtosis  values  for  a  given  value  of  a/p  than  the  other  distributions.  The  only 
exception  to  this  statement  is  the  P'  distribution,  for  which  the  skewness  can  be 
chosen  over  a  wide  range  as  desired.  The  resulting  values  of  kurtosis  for  the  P'  dis¬ 
tribution  also  span  a  wide  range. 

2.3.2  Limits  on  Ranges  of  Coefficients  of  Variation 

A  significant  outcome  of  the  discussion  in  §2.2  is  the  identification  of  limits  on  the  ac¬ 
cessible  values  of  coefficients  of  variation  a/p.  These  limits  all  bear  on  the  question  of 

which  distributions  are  applicable  to  a  given  application.  Because  of  their  importance, 

the  results  are  collected  in  Table  2.5. 

Table  2.5:  Accessible  ranges  of  coefficients  of  variation  for  the  distributions  considered  in  §2.2. 


Distribution 

range  of  s  =  a/p 

inverse  Gaussian 

no  limit 

Gaussian 

no  limit 

exponential 

S  =  1 

log-normal 

no  limit 

gamma 

s  <  1 

bi-exponential 

l/dl  <  s  <  1 

Weibull 

S<1 

log-logistic 

no  limit 

log-logistic,  finite  skewness 

s  <  0.8087 

log-logistic,  finite  kurtosis 

s  <  0.5227 

inverted  gamma 

no  limit 

inv.  gamma,  finite  skewness 

S<1 

inv.  gamma,  finite  kurtosis 

s  <  l/V2 

beta-prime 

no  limit 

beta-prime,  finite  skewness 

s  <  l/V3 

beta-prime,  finite  kurtosis 

s  <  1/V2 
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2.3.3  Fifth  and  Higher  Moments 

A  recent  study  examined  the  effect  of  varying  the  distribution  of  service  times  in  the 
application  of  queueing  theory  to  the  modelling  of  maritime  interception  operations 
[2].  In  the  course  of  this  study,  it  was  observed  that  the  exponential  distribution  gives 
results  noticeably  different  from  the  other  distributions  studied,  which  included  the 
inverse  Gaussian,  log-normal  and  gamma  distributions,  among  others.  One  may  seek 
an  explanation  for  this  in  the  behaviour  of  the  higher  moments.  The  distributions 
concerned  are  the  inverse  Gaussian  and  log-normal,  since  these  were  the  only  two 
included  in  Ref.  2  that  can  have  a  =  p,  a  property  characteristic  of  the  exponential 
distribution.  The  values  of  skewness  and  kurtosis  of  the  three  distributions  (Table  2.1, 
p.  6;  Figs  2.5c,d)  are  not  different  enough  to  explain  the  effects  observed  in  Ref.  2,  so  it 
is  interesting  to  look  at  yet  higher  moments.  It  turns  out  that  general  expressions  for 
these  are  available  or  can  be  readily  derived. 

The  quantities  of  interest  are  the  rth  moments  pr  about  the  mean  and  the  rth 
moment  ratios  r|r; 

pr  =  J^(f  -  p)r  /(f)  df ,  T)r  =  prM  (2.39) 

the  second  being  the  natural  generalisation  of  skewness  and  kurtosis.  These  quantities 
can  be  readily  evaluated  for  the  exponential  distribution,  giving 

<r — 3), 

k=0  X  ' 

where 

r\_  r! 
kj- k\ (r-k)l 

is  the  binomial  symbol. 

For  the  log-normal  distribution,  Johnson  et  ah  [10(§14.3)]  give  a  formula  the  rth 
moment  about  the  mean.  Setting  a  =  p  in  this  expression,  one  finds 


(2.40) 

(2.41) 


%I-n  =  t  ("I)*  [l)  2(r-*)(r-fc-1)/2  (r  *  3).  (2.42) 

k=0  X  ’ 

For  the  inverse  Gaussian  distribution,  Evans  et  al.  [12]  give  an  expression  for  the 
rth  moment  about  the  origin.  This  can  be  converted  to  the  rth  moment  about  the  mean 
by  a  standard  transformation  [12].  Once  again  setting  a  =  p,  one  obtains 


ryi~1  (r-k +  1-1)1 
h  (r-k-l-l)W.2l 


(r>3). 


(2.43) 


These  three  quantities  are  compared  in  Figure  2.6  up  to  r  =  10.  The  impetus  for 
this  comparison  was  a  study  in  which  use  of  the  exponential  distribution  gave  results 
significantly  different  from  the  other  two  [2].  However,  if  one  distribution  can  be  said 
to  behave  differently  from  the  others  in  Figure  2.6,  it  is  the  log-normal  rather  than  the 
exponential.  Clearly,  the  explanation  of  the  observed  effect  must  be  sought  elsewhere. 
One  possibility  concerns  modes  (Fig.  2.5a):  the  mode  of  the  exponential  distribution  is 
at  f  =  0;  for  the  other  two  it  lies  at  0.303  p  and  0.354  p  (inverse  Gaussian  and  log-normal 
respectively  with  a  =  p).  Another  possibility  is  the  decision  rates,  discussed  in  the  next 
Section. 
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Figure  2.6:  Values  of  the  r th  moment  ratios  r)r  of  the  exponential  distribution  and  the  inverse 
Gaussian  and  log-normal  distributions  with  a  =  p.  Note  the  logarithmic  scale  on  the  ordinate. 
The  points  at  r  =  3  and  r  =  dare  the  skewness  and  kurtosis  respectively.  The  lines  are  intended 
solely  as  guides  to  the  eye. 


2.3.4  Decision  Rate 

In  the  context  of  distributions  of  decision  times,  the  decision  rate  h(t)  is  the  probability 
density  of  a  decision  occurring  at  time  t  on  the  condition  that  a  decision  has  not  occur¬ 
red  at  times  earlier  than  t.  That  is,  it  is  given  by 


{2M) 

where  F(f)  is  the  cumulative  probability  corresponding  to  the  probability  density /(f):  (e) 

F<f)  =  fo  /(f’)dt'  •  (2.45) 


The  quantity  defined  in  Equation  (2.44)  has  also  been  termed  'hazard  rate'  [3,12,13,1 7], 
'failure  rate'  [12,17-19]  and  'force  of  mortality'  [12,13].  It  is  widely  used  as  a  means  of 
classifying  distributions  (e.g.  [10(§33.2),19]). 

Many  of  the  probability  distributions  considered  here  have  surprisingly  simple 
expressions  for  their  decision  rate,  albeit  in  terms  of  a  variety  of  special  functions.  The 
inverse  Gaussian  is  not  one  such;  its  decision  rate  is  [18] 


hiG(t)  = 


_ Ml _ , 

O  [v  (!  -  t/[i)yfp/t  ]  -  elv2  <h  [-V  (1  +  f/p)  Jyfjt] 


(2.46) 


where  O(f)  is  the  cumulative  probability  of  the  Gaussian  distribution  with  mean  of  zero 
and  unit  standard  deviation. 

The  expressions  for  the  decision  rate,  simple  or  otherwise,  are  collected  in  Table 
2.1  (pp.  6-7).  The  point  leading  to  an  argument  for  using  the  inverse  Gaussian  distribu¬ 
tion  to  modelling  decision  speed  concerns  the  behaviour  of  h(t)  over  its  domain  and 
particularly  as  t  -»  oo.  Surveying  the  various  distributions,  we  see  that: 


(e) 


The  lower  limit  of  the  integral  must  be  taken  as  -  oo  for  two-sided  distributions  such  as  the 
Gaussian. 
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•  The  exponential  distribution  has  a  constant  decision  rate  equal  to  1/  p. 

•  All  distributions  other  than  the  exponential  and  Gaussian  have  h(0)  =  0.(f) 

•  The  Weibull  distribution  with  j3  >  1  (the  case  of  interest  in  this  Report)  and  P'  distri¬ 
bution  have  monotonically  increasing  decision  rates  with  no  upper  limit 

•  The  bi-exponential  distribution  and  gamma  distribution  with  r  >  1  also  have  mono¬ 
tonically  rising  decision  rates,  but  they  asymptote  as  t  — »  co  to  h^e  =  a  and  /ir  =  0  = 
b/^2. 

Except  for  the  property  h{ 0)  =  0,  none  of  this  behaviour  seems  realistic  in  a  model  of 
decision  rate.  As  the  following  dot  points  explain,  it  is  likely  that  real-life  decision  rates 
rise  from  zero  at  t  =  0  to  a  maximum  value  and  then  decrease  to  a  non-zero  value  as 
t  — )■  cot 

•  The  rate  should  be  low  initially  while  the  decision  maker  assesses  the  situation. 

•  It  would  then  rise  as  the  decision-maker's  situation  awareness  increases  and  recog¬ 
nition-primed  decisions  [1]  are  made. 

•  This  would  be  followed  by  a  decline  as  the  'easy'  decisions  are  taken,  leaving  the 
more  difficult  ones  to  be  dealt  with. 

•  However,  the  decision  rate  ought  not  to  decline  to  zero;  while  ever  a  decision  re¬ 
mains  to  be  made,  one  expects  a  non-zero  probability  that  it  will  be  made. 

Of  the  distributions  in  Table  2.1,  only  the  inverse  Gaussian  distribution  has  a  decision 
rate  that  rises  from  zero,  peaks  and  then  declines  to  a  non-zero  value  as  t  -»  co.  That  is, 
only  the  inverse  Gaussian  distribution  fulfils  all  of  the  requirements.  This  is  somewhat 
remarkable,  in  view  of  the  diversity  of  distributions  compiled  in  Table  2.1. 

Figure  2.7  shows  the  time  dependence  of  the  decision  rate  for  three  distributions 
with  c  =  p,  being  the  three  distributions  discussed  in  the  previous  Section.  Decision 
rate  provides  another  aspect  that  distinguishes  the  exponential  distribution  from  the 
other  two. 


Figure  2.7;  Time  dependence  of  decision  rates  hft )  for  three  distributions  with  o  =  p.  To  exploit 
scaling  belmviour,  phft)  is  plotted  against  t/p.  The  product  phft )  for  the  log-normal  distribu¬ 
tion  asymptotes  to  zero  as  t  — >  co;  that  of  the  inverse  Gaussian  distribution  asymptotes  to  0.5  (in 
the  case  a  =  p). 


(f)  For  the  gamma  and  Weibull  distributions,  this  is  true  only  for  parameter  values  of  interest  in 
this  report,  i.e.  those  giving  s  <  1.  (When  s  =  1,  both  distributions  degenerate  to  the  exponen¬ 
tial  distribution  and,  for  s  >  1,  both  start  at  h( 0)  =  oo  and  fall  monotonically  as  t  — >  co.) 
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3.  Finite-Domain  Distributions 

The  initial  interest  in  distributions  on  a  finite  domain  arose  from  considerations  of  mo¬ 
delling  decision  soundness,  since  soundness  scales  typically  are  finite.  If  the  scale  is 
continuous,  the  p  distribution  is  useful  because  of  the  wide  range  of  distribution  shape 
available.  Often,  soundness  scales  are  discrete  ('Likert  scales'),  so  there  is  interest  in 
discrete  probability  distributions  from  that  point  of  view.  Both  these  topics  are  discuss¬ 
ed  in  this  Section,  but  first  the  following  observation  is  addressed. 

Table  2.1  and  Figure  2.5(c)  show  that  the  Gaussian  distribution  has  a  kurtosis  of 
3,  independent  of  the  value  chosen  for  the  standard  deviation,  and  that  almost  all  other 
distributions  have  larger  kurtosis.  The  sole  exception  is  the  Weibull  distribution  for  a 
limited  range  of  values  of  p,  and  even  there  the  minimum  value  available  is  -2.71,  not 
much  less  than  3.  In  the  interests  of  exploring  a  wide  range  of  behaviour,  it  may  be  use¬ 
ful  on  some  occasions  to  use  distributions  with  markedly  lower  kurtosis  than  that  of 
the  Gaussian,  even  if  their  other  properties  make  them  unlikely  candidates  for,  say,  a 
realistic  description  of  decision  times. 

3.1  Simple  Low-Kurtosis  Continuous  Distributions 

This  Section  presents  properties  of  three  simple  distributions,  two  of  which  clearly 
have  rather  lower  kurtosis  that  the  Gaussian  distribution.  Their  definitions  are  given 
below  and  their  properties  are  collected  in  Table  3.1. 

3.1.1  Impulse  Distribution 

The  impulse  or  deterministic  distribution  has  a  single  parameter,  its  mean  p,  and  is 
defined  by 

/i(f)  =  8(f-p),  (3.1) 

where  8(x)  is  the  impulse  symbol,  also  known  as  the  Dirac  delta  function.  The  effect  of 
Equation  (3.1)  is  that  p  is  the  only  possible  value  for  f;  the  probability  of  obtaining  any 
other  value  is  strictly  zero.  The  corresponding  cumulative  probability  (Eqn  (2.45))  is 
the  unit  step  function  located  at  t  =  p. 


Table  3.1:  Properties  of  three  simple  probability  density  functions  with  finite  domain. 


Impulse 

Uniform 

Triangular 

Range  of  s  =  cj/p 

s  =  0 

s  <  1/V3 

s  <  1/V8  to  1/V2* 

Mode  tm 

P 

none 

Skewness 

0 

0 

Eqn  (3.7) 

Kurtosis  ti4 

undefined! 

9/5 

12/5 

r  2  (t-a) 

(a<t<tm) 

(tm<t<b) 

Decision 
rate  h(t)$ 

8(t-p) 

~  (a<t<b) 
b-t 

0  (otherwise) 

(b-a)tm-t2  +  2  at  -  ab 

2 

b-t 

0 

(otherwise) 

*  Depending  on  TI3,  see  §3.1.3.  +  See  §3.1.1.  1  See  §2.3.4. 
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It  may  seem  that  this  distribution  has  the  smallest  possible  kurtosis,  namely  zero, 
since  all  moments  about  the  mean  are  zero.  However,  the  kurtosis  is  defined  as  the 
ratio  of  two  such  moments,  the  fourth  to  the  square  of  the  second,  and  it  turns  out  that 
the  result  is  strictly  undefined.  To  see  this,  consider  the  common  method  of  introduc¬ 
ing  the  impulse  symbol  using  a  limiting  sequence  of  appropriate  functions  [e.g.  20(ch. 
5)].  Choosing  a  sequence  of  rectangles  gives  a  kurtosis  of  9/5;  a  sequence  of  triangles 
gives  a  value  of  12/5;  one  of  Gaussians  gives  3,  etc.  Since  all  these  sequences  are  held 
to  represent  exactly  the  same  object,  the  impulse  symbol,  it  follows  that  its  kurtosis  is 
not  a  well-defined  quantity,  (g)  Nevertheless,  there  is  no  doubt  that  the  impulse  distri¬ 
bution  represents  an  extreme  of  behaviour  among  probability  density  functions,  and 
this  in  itself  makes  it  useful  in  some  circumstances. 


3.1,2  Uniform  distribution 


The  uniform  distribution,  perhaps  the  simplest  of  all  distributions,  has  a  kurtosis  of  just 
9/5,  the  lowest  encountered  in  this  work  among  the  continuous  distributions. (U  The 
probability  density  is 


/«(*)  = 


b-a 


(a<t<b) 


(3.2) 


[0  (otherwise) 


where  we  require  a  >  0  to  ensure  that/u(0)  =  0.  The  mean  and  standard  deviation  are 


a  +  b 


a  = 


b-a 

7vi ' 


(3.3) 


Rewritten  using  a,  p  as  parameters.  Equation  (3.2)  is 


/«(*)  = 


'  1 
2cW3 
0 


(p  — cW3  <f  <p  +  CT^) 
(otherwise). 


(3.4) 


Hence,  the  requirement  a  >  0  places  a  limitation  on  the  values  of  s  =  c/p  that  can  be  ob¬ 
tained:  s  <  1/V3.  The  uniform  distribution  does  not  have  a  mode  —  it  has  no  peak.  Other 
properties  are  listed  in  Table  3.1. 


3.1.3  Triangular  Distribution 

The  triangular  distribution  has  probability  density  that  is  zero  for  t  <  a,  rises  linearly 
from  t  =  a  to  the  mode  fm,  falls  linearly  from  t  =  tm  to  zero  at  f  =  b  and  is  zero  thereafter: 


2  (t-a) 
(b-a)(tm-a) 


(a<t<tm) 


m= 


2  (t-a) 


(b-a)(tm 

0 


(tm<t<b) 


(otherwise). 


(3.5) 


(g)  The  skewness  of  the  impulse  distribution  is  well  defined.  All  valid  limiting  sequences  must 
be  sequences  of  functions  that  are  symmetric  about  p,  and  hence  all  agree  that  the  skewness 
is  zero. 

(h)  Only  the  discrete  uniform  (§3.3,1)  and  p-binomial  distributions  (§§3,3.4  and  3.3.5)  have  lower 
accessible  kurtosis  values. 
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where  a  >  0  to  keep  /t(0)  =  0.  Equation  (3.5)  is  a  generalisation  of  the  frequently  antho- 
logised  symmetric  triangular  distribution,  which  is  obtained  by  setting  tm  =  (a  +  b)/2. 
The  mean  and  standard  deviation  of  the  general  distribution  are  listed  in  tabulations 
[7,12],  but  the  higher  moments  seem  not  to  have  been  previously  published.  The  mean 
and  variance  are 


bt  = 


a  +  tm  +  b 


o  9  9 

2  _a  +fm  +k  - flfm  - ab -ht, 

at  ~ - 


m 


3  1  18 

Straightforward,  though  lengthy,  calculation  gives  the  skewness  as 

_  2(a3  +  tm3  +  b3)  -3(a2tm  +  atm2  +  a2b  +  ab2  +  b2tm  +btm2)  +  llabt^ 

rj3t - - - 

270a3 


(3.6) 


(3.7) 


The  kurtosis  is  r|4t  =  12/5  regardless  of  the  parameter  values.  The  r|4t  value  is  well  known 
for  the  symmetric  triangular  distribution,  but  it  does  not  seem  to  have  been  previously 
recognised  that  it  applies  in  general/1)  It  is  a  most  remarkable  result.  It  means  that  the 
skewness  can  be  varied  while  holding  mean,  standard  deviation  and  kurtosis  constant, 
despite  the  fact  that  the  triangular  distribution  has  just  three  parameters.  This  property 
seems  to  be  unique  among  probability  distributions.  It  recently  proved  useful  in  a 
study  of  the  effect  on  a  queueing  system  of  varying  service-time  distributions  [2], 

The  inversion  of  Equations  (3.6)  and  (3.7)  is  algebraically  challenging,  but  was 
performed  with  the  aid  of  a  computer  algebra  package.  The  result  is 

fl  =  pt  cos^-atV6sin^, 

fm  “bt -crt>/2cosl;  +  atA/6sin£,  (3.8) 

b  =  pt  +2atA/2cos^, 


where  the  auxiliary  angle  is  given  by 


5  = 


— arctan 
3 


V8-25ri3t2 

5rl3t 


(3.9) 


To  evaluate  Equation  (3.9)  appropriately,  the  result  of  the  arctan  function  must  be 
taken  in  the  range  [0,n],  which  is  not  the  usual  range  assumed  by  calculators  and 
software  routines.  That  is,  ri3t  =  0  corresponds  to  6  =  n/6  and  negative  values  of  ri3t 
correspond  to  n/6  <^<  n/3. 

In  the  course  of  the  derivation  of  Equations  (3.8)  and  (3.9),  one  finds  that  the 
available  range  of  skewness  values  is  limited  to 


(3.10) 


unlike  the  case  of  the  P'  distribution,  negative  skewness  is  accessible.  The  limits  cor¬ 
respond  to  the  'sawtooth  distributions':  r|3t  =  -Vs/5  (£,  =  n/3)  when  fm  =  b  and  r|3t  = 
+V8/5  (6,  =  0)  when  fm  =  a.  The  first  line  of  Equation  (3.8)  implies  a  limit  on  the  value 
of  a  from  the  requirement  that  a  >  0: 


(0  Johnson  et  al.  [10(§26.9)]  quote  a  formula  for  general  moments  about  tm.  Ayyangar  [21]  gives 
an  expression  for  moments  about  the  mean  in  terms  of  moments  about  fm,  but  evaluates  it 
for  the  symmetric  distribution  only. 
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s,=a<£2?f£«±VS.  (3.11) 

pt  V8 

That  is,  the  value  of  the  limit  depends  on  the  chosen  skewness:  the  limit  is  quite  low  for 
maximum  negative  skewness  (st  <  1  A/8  when  r|3t  =  — n/s/5)  and  rises  as  r|3t  is  increas¬ 
ed,  reaching  st  <  1/V2  when  T}3t  =  +V8/5 . 


3.2  The  General  Beta  Distribution 


The  general  J3  distribution  is  a  very  general  finite-domain  continuous  distribution. 
Having  four  parameters,  a  wide  range  of  curve  shapes  is  encompassed.  Its  standard 
form  is  [10(ch.25)] 


/p(*H 


\t-a)P-1(b-t)1 

B(p,q)(b-af+cI 


-1 


( a<t<b ) 


(3.12) 


[o 


(otherwise). 


where  B{p,q)  is  the  beta  function  (Eqn  (2.30))  and  p,  q  must  both  be  positive.  We  consi¬ 
der  only  those  distributions  that  are  finite  at  both  end  points,  which  implies  p>l,q>l. 
In  addition,  we  require  a  >  0,  as  usual.  The  case  p  =  1,  q  =  1  is  exactly  the  uniform  distri¬ 
bution  on  [a,b],  and  the  cases  p  =  l,q  =  2  and  p  =  2,  q  =  1  are  the  two  sawtooth  distribu¬ 
tions  mentioned  in  §3.1.3.  Obviously,  the  kurtosis  of  the  (3  distribution  is  low  for  at 
least  part  of  its  parameter  space. 

The  properties  of  the  p  distribution  are  well  known  [10(ch,25),13(ch.l4)].  The 
mean  p  and  variance  a2  are 


P 


=  a  + 


(b-a)p 
p  +  q 


c2  Q’-afpq 

(p  +  qf(p  +  q  +  l)' 


(3.13) 


the  skewness  ri3  and  kurtosis  r|4  are 

2 (g-p)  fp+qT T  _Hp  +  q  +  1)[2(p-‘j)Z+W(P  +  cI  +  2)]  . 

P  +  q  +  2y  pq  '  T*4  pq(p  +  q  +  2)(p  +  q  +  3) 


and,  for  the  case  p>l,q>l,  the  mode  tm  is 


tm  -a  + 


(b-a)(p- 1) 

p+q- 2 


(3.15) 


Following  the  theme  of  this  Report,  we  wish  to  invert  Equations  (3.13)  and  (3.14)  to  ob¬ 
tain  expressions  for  a,  b,  p,  q  in  terms  of  p,  ct,  ti3,  TI4.  Of  the  distributions  included  in 
this  Report,®  the  P  distribution  is  the  only  one  where  such  expressions®)  have  been 
found  elsewhere  [10(§25.4))].  In  terms  of  the  auxiliary  variables 


0)  See  also  footnote  (b)  (p.  11). 

®)  As  Johnson  et  ah  note  [10(§25.4)],  sometimes  values  of  a,  b  are  set  by  external  considerations 
and  the  requirement  is  to  choose  p,  q  to  give  desired  values  of  p,  ct.  This  is  a  relatively  simple 
problem,  the  solution  to  which  is 


P  = 


p-fl 

~(p-a)(b-p) 

a-^ 

'(p-o)(b-p) 

b-a 

s2  J 

b-a 

L  CT2 
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r  = 


6  T14-H32-1 
3il32-2Tl4+6 


(r  +  2)r|3 

V(r  +  2)2ti32  +16(r  +  1) 


(3.16) 


the  expressions  for  p,  q  are 

p  =  r(l-R)/2,  q  =  r(l  +  R)/2. 


(3.17) 


A  third  auxiliary  variable  facilitates  expressions  for  a,  b: 

A  =  ^yl(r  +  2)2j]i  +  16(r  +  l);  =  &=A+a.  (3.18) 


What  is  missing  from  the  discussion  of  Ref.  10  is  a  description  of  the  accessible 
range  of  skewness,  kurtosis  and  coefficient-of-variation  values.  This  turns  out  to  be  a 
complicated  problem,  with  many  factors  potentially  contributing  to  the  boundaries  of 
the  accessible  parameter  space.  For  example,  both  A  and  R  must  be  real,  which  means 
that  t|3,  ri4  are  limited  to  values  for  which  the  square-root  functions  have  non-negative 
arguments.  Other  requirements  are  p  >  1  and  q>  1.  Figure  3.1  shows  an  accessible  area 
in  t|3-r|4  space.  The  upper  boundary  is  part  of  the  line  of  singularity  of  r;  the  lower 
boundary  comes  from  the  condition  that  p>  1  and  q>  1.  That  is,  the  solid  line  in  Figure 
3.1  is  given  by 

t|4  =  3  +  3r|32/  2,  (3.19) 

the  condition  for  which  the  denominator  of  r  is  zero.  The  requirements  p  >  1  and  q  >  1 
generate  a  cubic  form  in  q32  and  q4.  Treated  as  an  equation  in  114,  this  has  three  real 
roots,  the  largest  of  which  for  any  given  q32  value  is  shown  as  the  broken  line  in  Figure 
3.1.  The  equation  of  this  line  is 


%  =Mcosip+Mj3sin<(.+  60  +  54t|32  ~3t|34/8, 

50-r,32 

where 


w 

(t132  -f 

-100)(t|3 

2  +  4  f 

8 

50  -r,32 

9 


=— arctan 
3 


32r|3(r|32  -  50) 
2000-360t132+t|34  ' 


(3.20) 


(3.21) 


Figure  3.1:  An  area  of  accessible  values  ofr |3  and  r\4  for  the  general  p  distribution,  shown  as  the 
region  between  the  two  lines.  It  is  conjectured  that  this  is  the  only  such  area. 
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Although  not  proven,  it  is  conjectured  on  the  basis  of  algebraic  analysis  and  some  trial- 
and-error  evaluations  that  the  region  between  the  lines  in  Figure  3,1  is  the  only  area  of 
accessible  TI3  and  r| 4  values  for  the  general  p  distribution.  The  highest  values  are  |r[3  j  = 
2,  r)4  =  9,  like  the  exponential  distribution.  The  lowest  kurtosis  accessible  is  r|4  =  9/5, 
which  can  only  be  achieved  when  pg  =  0;  in  other  words,  this  is  exactly  the  uniform 
distribution. 

The  question  of  the  range  of  a/p  values  accessible  has  not  been  explored,  but 
it  can  be  expected  that  the  condition  a  >  0  places  an  upper  limit  on  a/  p,  the  value  of 
which  is  a  function  of  t]3  and  114. 


3.3  Discrete  Probability  Distributions 

The  use  of  a  discrete-valued  scale  of  measurement  is  common  in  the  surveys  used  to 
gather  data  on  decision  soundness  in  specific  situations.  Hence,  there  is  interest  in  dis¬ 
crete  probability  distributions  for  modelling  decision  soundness.  Because  the  scales 
used  often  have  few  levels,  for  example  0-5  or  1-7,  there  is  no  necessity  to  employ  a 
formula-based  distribution:  it  would  be  feasible  to  regard  a  scale  with  n  +  1  levels  as 
having  n  parameters— namely  the  probabilities  of  obtaining  each  of  n  of  the  levels  in 
any  given  trial— subject  only  to  the  restriction  that  the  sum  of  the  probabilities  for  all 
n  +  1  levels  must  be  unity.  However,  the  degree  of  freedom  entailed  by  this  approach 
can  be  uncomfortably  large,  giving  rise  to  a  desire  for  a  distribution  with  a  small  num¬ 
ber  of  parameters.  There  is  evidently  no  theoretical  basis  for  choosing  any  particular 
probability  distribution  for  the  purpose  of  modelling  decision  soundness,  so  this  Sec¬ 
tion  compares  the  properties  of  several  discrete  probability  distributions. 

At  first  sight,  there  seem  to  be  few  candidates;  most  references  list  just  three 
discrete  distributions  with  finite  domain:  the  uniform,  binomial  and  hypergeometric. 
More  comprehensive  texts  [6,12,22,23]  mention  a  fourth,  the  P-binomial  distribution, 
which  is  not,  in  fact,  independent  of  the  other  three.  It  is  unsurprising  to  learn  that 
there  are  many  more,  but  these  are  treated  only  in  journals,  specialised  texts  (e.g,  [24]) 
and  the  most  comprehensive  compilations  on  discrete  distributions  [25,26].  Since  these 
other  distributions  are  less  known,  only  the  four  distributions  named  above  are  treated 
here  in  any  detail.  Of  these,  the  uniform  distribution  is  of  little  interest,  as  explained  in 
§3.3.1.  The  main  properties  of  the  other  three  are  collected  in  Table  3.2, 

Section  3.3.6  lists  a  few  properties  of  some  of  the  lesser  known  distributions,  pri¬ 
marily  as  a  cursory  indication  of  the  range  of  distributions  that  have  been  studied.  For 
many  of  these,  our  program  of  expressing  parameters  in  terms  of  moments  cannot  be 
carried  through  because  either  there  are  no  closed-form  expressions  for  the  moments, 
or  the  expressions  that  exist  are  algebraically  intractable, 

3.3.1  Discrete  Uniform  Distribution 
The  discrete  uniform  distribution  is© 

W„(0=^T  (0  <!<„).  (3.22) 

Often,  n  is  determined  by  external  factors,  such  as  the  nature  of  the  particular  Likert 
scale  used;  in  these  cases,  this  distribution  can  be  regarded  as  having  no  parameters. 

©  The  lower  limit  of  the  domain  is  taken  as  zero  rather  than  the  more  usual  value  of  unity  in 
the  interests  of  compatibility  with  the  other  discrete  distributions  treated  herein. 
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Table  3.2:  Properties  of  three  discrete  probability  density  functions  defined  on  0  <t  <n. 


Binomial 

Hypergeometric 

Beta  binomial 

Mean  p 

na 

nM 

~N~ 

na 

a  +  b 

Mode  fm* 

L(n  +  l)aJ 

(n  +  1)(M  +  1) 
N  +  2 

(n  +  l)(«-l) 
ci  +  b  —  2 

Variance  a2 

na(l  -  a) 

nM(N -n)(N  -  M) 

nab(a  +  b  +  n ) 

N2(N- 1) 

(a  +  b)2  (a  +  b  + 1) 

Skewness 

1-2  a 

(N -2n)(N -2M) 

(b-a)(a  +  b  +  2n) 

F3 

CTb 

N(N- 2)ah 

+  +  2)<jj 

p-b 

N(N  +  \)-6n(N-n 

L 

a2  -4ab  +  b2  - a-b+3nb(a  +  b  +  n) 

Kurtosis  r|4 

CTb  n 

(N-2)(N-3)ah2 
3(V-l)(AT  +  6)  6M(N-M) 

(a  +  b  +  +  b  +  3)ap_t>2 

6(a  +  &  +  l)^fl2  -ab  +  b2^ 
ab(a+b  +  2)(a  +  b  +  3) 

(N-2)(N-3)  (N  -  2)(N  -  3)ah2 

*  [xj  =  largest  integer  <x.  If  x  is  exactly  integral,  then  there  are  two  equal  maximum  values,  at  [xj 
and  [xj  - 1.  If  an  expression  exceeds  n,  then  the  mode  is  at  n;  if  less  than  zero,  then  it  is  at  zero. 


This  point  of  view  is  adopted  here.  Hence,  the  distribution  is  of  little  interest  in  the  pre¬ 
sent  context,  since  there  is  nothing  to  adjust.  For  reference,  its  moments  are: 

Pdu~n/2-'  adu  =  n(n  +  2)/12,  r]3du  —  0/  h4du  =9/5  —  j.  (3.23) 

3.3.2  Binomial  Distribution 

The  binomial  distribution  has  a  single  parameter  a  satisfying  0  <  a  <  1.  The  distribution, 
defined  for  integer  t  in  the  range  [0,n],  is 

Pb(0  =  (”)°'(l-« (3.24) 

where  the  first  bracketed  factor  on  the  right-hand  side  is  the  binomial  symbol  (Eqn 
(2.41),  p.  16).  As  Table  3.2  indicates,  the  parameter  a  equals  pb/n.  This  leads  to  simple 
expressions  for  variance,  skewness  and  kurtosis  in  terms  of  the  mean;  values  are 
shown  in  Figure  3.2  for  n  =  10.  As  this  Figure  indicates,  the  full  range  of  skewness, 
positive  and  negative,  occurs.  It  is  interesting  that  the  minimum  value  of  kurtosis, 
which  occurs  when  |q,  =  n/2,  equals  3  -  2/n,  a  little  less  than  the  value  of  3  applying  to 
the  Gaussian  distribution. 


3.3.3  Hypergeometric  Distribution 

The  standard  form  of  the  hypergeometric  distribution  reads  [6] 


-1 


(3.25) 


where  0  <  t  <  n.  The  parameters  M  and  N  must  both  be  integers  and  satisfy  M>n,N> 
M  +  n.  The  requirement  for  M,  N  to  be  integers  leads  to  a  complication  in  the  inversion 
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Figure  3,2:  Standard  deviation,  skewness  and  kurtosis  of  the  binomial  distribution  as  functions 
of  the  mean  for  n  =  10.  Only  the  left  half  of  the  domain  is  shown;  the  standard  deviation  and 
kurtosis  are  symmetric  about  \ib  =  5,  the  skewness  antisymmetric. 


of  the  expressions  for  p,  a.  The  expressions  (Table  3.2)  can  be  readily  inverted  if  N,  M 
are  treated  as  real  numbers,  called  N,  M  in  the  following  equations: 


N  =  n  M  =  u  H&Ztihgl 

p .(w-p)-ncn  p(tt-p)-ttcr 


(3.26) 


To  complete  the  inversion,  N,  M  must  be  converted  to  integers  to  obtain  N,  M  respec¬ 
tively,  but  it  is  not  clear  how  to  formulate  a  rule  for  rounding  that  will  lead  to  p  and  a 
values  as  close  as  possible  to  the  desired  values.  Presumably  each  instance  should  be 
treated  on  its  merits,  using  trial  and  error. 

The  question  of  the  accessible  ranges  of  p,  a  turns  out  to  reveal  an  aspect  of  the 
connection  between  this,  the  binomial  and  the  p-binomial  distributions.  Hence,  it  is 
deferred  to  §3.3.5,  where  the  accessible  ranges  of  skewness  and  kurtosis  for  the  hyper¬ 
geometric  and  P-binomial  distributions  are  also  determined. 


3.3.4  Beta-Binomial  Distribution 
The  P-binomial  distribution  is  [6,12(p.3 7)] 


B(a  +  t,b  +  n-t) 
B(a,b) 


(3.2 7) 


where  once  again  0  <  t  <  n.  The  parameters  a,  b  need  not  be  integers  but  both  must  be 
greater  than  zero.  The  function  B(a,b )  is  the  standard  beta  function  (Eqn  (2.30),  p.  11), 
which  is  a  generalisation  of  the  binomial  symbol  to  non-integer  arguments.  This  distri¬ 
bution  is  also  known  as  the  negative  or  inverse  hypergeometric  distribution  [22(pp. 
155-60),23(pp.330-2),26(§6.2.2),27],  a  terminology  that  reflects  the  connection  between 
the  two  distributions  [26(§6.2.2),2 7], 

Inversion  of  the  equations  for  mean  and  variance  (Table  3.2)  gives  results  rather 
similar  to  Equation  (3.26),  another  indication  of  the  connections  between  the  two  distri¬ 
butions.  Here,  we  have 
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a- p 


p(n-p)-c2 

2  ' 
no  -p(n-p) 


b 


=  (n-\i) 


p(n-p)-g2 

2  ' 
no  -p(n-p) 


(3.28) 


but  this  time  the  expressions  can  stand  as  they  are,  since  a,  b  are  not  required  to  be 
integers.  The  question  of  the  accessible  ranges  of  p  and  a  is  again  deferred  to  the  next 
Section,  which  also  deals  with  the  ranges  of  skewness  and  kurtosis  accessible  to  this 
distribution. 


3.3.5  Comparisons  and  Ranges  of  Coefficient  of  Variation 
Figure  3.3  illustrates  the  three  distributions  of  interest.  This  figure  differs  from  Figure 
2.4  in  that  the  latter  shows  distributions  with  the  same  mean  and  standard  deviation. 
The  distributions  in  Figure  3.3  all  have  the  same  mean,  but  different  standard  devia¬ 
tions.  In  fact,  with  finite  parameter  values  it  is  not  possible  for  any  of  the  three  discrete 
distributions  to  have  simultaneously  the  same  mean  and  standard  deviation,  as  the 
following  argument  shows.  If  one  expresses  each  standard  deviation  a  in  Table  3.2  in 
terms  of  the  corresponding  mean  p,  one  obtains 


2 


N-n 
N-l  ’ 


Fp-b^l  a  +  b  +  n 
n  J a+b+1 ' 


(3.29) 


Now  n  >  1  in  all  cases  and  N  >  n  for  the  hypergeometric  distribution,  so  it  follows  that 
distributions  with  equal  means  have  ab  <  ab,  with  the  equality  being  approached  as 
N  — »  oo.  On  the  other  hand,  the  P-binomial  distribution  shows  the  opposite  behaviour— 
op.b  >  cjb— since  a  and  b  are  both  positive;  equality  is  approached  as  one  of  a  or  &  goes 


Figure  3.3:  Discrete  distributions  with  a  maximum  range  ofn  =  10  and  a  mean  of\x=  7.5.  The 
standard  deviation  of  the  binomial  distribution  is  fixed  by  the  mean  and  range  to  be  Vi. 8 75  = 
1.369.  The  standard  deviations  of  the  other  two  distributions  were  chosen  as  ah  =  1.20  and  ap_b 
=  1.50.  (See  text  for  discussion  of  these  a  values.) 
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to  infinity.  In  summary,  distributions  with  equal  means  have  their  standard  deviations 
ordered  as  <  cq,  <  crp.^,  with  strict  inequahties  for  finite  parameter  values.  Hence,  the 
three  discrete  distributions  are  not  alternatives  in  the  sense  that  the  various  continuous 
distributions  are;  which  is  chosen  depends  on  the  mean  and  standard  deviation  that 
one  wishes  to  model. 

The  other  restrictions  on  the  parameters  lead  to  limits  on  the  standard  deviations 
attainable  with  the  hyper  geometric  and  p-binomial  distributions  that  mirror  those 
given  above.  In  the  case  of  the  hypergeometric  distribution,  N  has  a  lower  limit  deter¬ 
mined  by  the  values  of  ph  and  n.  This  gives  a  lower  limit  on  the  standard  deviation: 


crh  >Phx 


n-P-  h 


y[bh\ 

(«2-Fh) 

r* 

[-Fh 

n2-n  +  \i  h 


(ph  <n/2) 


(ph>n/2). 


(3.30) 


For  the  p-binomial  distribution,  the  parameter  b  can  be  regarded  as  determined  by  the 
values  of  a,  pp_b  and  n.  The  lower  limit  on  a  (a  >  0)  then  places  an  upper  limit  on  the 
standard  deviation: 

°p-b  <  Fp-b  Jn/P p-b-1  (3-31) 


The  various  limits  on  the  standard  deviations  are  depicted  as  a  function  of  the  mean  in 
Figure  3.4  for  the  case  of  n  =  10.  In  view  of  the  relatively  wide  range  of  standard  devia¬ 
tion  accessible  to  the  P-binomial  distribution,  it  deserves  to  be  better  known.  It  should 
be  noted  that,  although  the  region  accessible  to  the  hypergeometric  distribution  is 
shown  as  a  structureless  area  in  Figure  3.4  (the  area  with  darker  shading),  in  fact  it  con¬ 
sists  of  a  lattice  of  isolated  points,  owing  to  the  requirement  that  the  parameters  N,  M 


0  2.0  4.0  6.0  8.0  10.0 


mean 

Figure  3.4:  Relationship  betiveen  standard  deviation  and  mean  for  the  three  discrete  distribu¬ 
tions  ivith  n  =  10.  Tire  heavy  line  shows  the  dependence  of  standard  deviation  on  mean  for  the 
binomial  distribution;  the  shaded  areas  show  the  accessible  values  for  the  other  distributions. 
The  accessible  region  for  the  hypergeometric  distribution  is  actually  a  lattice  of  isolated  points, 
since  the  parameters  of  that  distribution  must  be  integers;  the  density  of  tire  lattice  increases 
toward  the  heavy  tine. 
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be  integers.  Since  N  —*  co  corresponds  to  approaching  the  heavy  line  in  Figure  3.4  from 
below,  the  lattice  of  points  increases  in  density  toward  the  heavy  line. 

All  three  distributions  have  positive  skewness  if  p  <  n/2,  zero  skewness  when  p  = 
n/2  and  negative  otherwise,  as  expected.  The  behaviour  of  the  skewness  and  kurtosis 
of  the  binomial  distribution  is  shown  in  Figure  3.2.  Corresponding  plots  for  the  other 
two  distributions  are  more  complicated  because  their  skewness  and  kurtosis  do  not 
depend  on  the  coefficient  of  variation  a/p  alone.  That  is,  r|3  and  r|4  for  the  hypergeo¬ 
metric  and  P-binomial  distributions  are  functions  of  two  variables  with  domains  of 
definition  shown  as  the  shaded  areas  in  Figure  3.4.  The  p-binomial  distribution  is  con¬ 
sidered  first,  since  it  has  a  well  defined  parameter  inversion  (Eqn  (3.28)). 

Figure  3.5  shows  contour  plots  of  skewness  and  kurtosis  for  the  p-binomial  distri¬ 
bution.  The  broken  lines  in  this  Figure  outline  the  domain  of  definition,  which  is  the 
lighter-shaded  area  in  Figure  3.4.  Only  the  left  half  of  the  domain  is  shown;  skewness  is 
antisymmetric  about  p  =  n/2  and  kurtosis  is  symmetric.  As  Figure  3.5  indicates,  both 
skewness  and  kurtosis  increase  without  limit  as  pp_b  — >  0.  In  terms  of  distribution  para¬ 
meters,  this  corresponds  to  a  — >  0.  The  skewness  is  zero  when  pp_b  =  n/2,  that  is,  when 
b  =  a.  The  smallest  kurtosis  value  also  lies  on  this  line  and  at  the  top  of  the  domain  of 
definition,  which  corresponds  to  the  parameters  a  — »  0,  b  — >  0.  The  value  of  this  mini¬ 
mum  is  1.0,  the  smallest  encountered  in  this  work  for  any  distribution,  whether  contin¬ 
uous  or  discrete,  finite  or  infinite.  In  fact,  as  one  traces  the  line  pp_b  =  n/2  from  bottom 
(a  =  oo)  to  top  (a  =  0),  the  kurtosis  falls  from  the  binomial-distribution  value  of  3  -  n/2 
(=  2.8  for  n  =  10)  to  unity.  That  is,  low  kurtosis  is  found  along  the  entire  line  given  by 
Pp-b  =  n/2  and  there  is  a  region  at  the  top  of  the  domain  of  definition  where  the  kurto¬ 
sis  is  smaller  than  the  value  of  1.2  applying  to  the  uniform  distribution. 

Figure  3.6  shows  corresponding  results  for  the  hypergeometric  distribution.  This 
Figure  was  prepared  using  Equations  (3.26)  and  taking  N  =  N,  M  =  M,  that  is,  ignoring 
the  discretisation  of  the  domain.  As  with  the  p-binomial  distribution,  the  skewness  of 
the  hypergeometric  distribution  is  zero  on  the  line  pb  =  n/2  and  rises  without  limit  as 
pb  — »■  0.  The  inset  indicates  this  rather  schematically.  (In  terms  of  parameters,  pb  — »  0 
corresponds  to  N  — >  oo.)  The  kurtosis  also  rises  without  limit  as  pb  — >  0.  On  the  line  pb  = 


Figure  3.5:  Contours  of  constant  (a)  skewness  and  (b)  kurtosis  for  the  p- binomial  distribution 
xvith  n  =  10.  The  broken  lines  delineate  the  domain  of  definition,  which  is  the  same  as  the  lighter 
shaded  area  in  Figure  3.4.  Only  the  left  half  of  the  domain  is  shoivn;  the  skewness  is  antisym¬ 
metric  about  pp_&  =  5,  the  kurtosis  symmetric. 
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0  1.0  2.0  3.0  4.0  5.0  0  1.0  2.0  3.0  4.0  5.0 


mean 

Figure  3.6:  As  in  Figure  3.5,  but  for  the  hypergeometric  distribution.  In  this  case,  the  domain  of 
definition  is  the  darker  shaded  area  in  Figure  3.4.  The  inset  in  each  panel  shows  detail  near  the 
origin.  Because  the  upper  and  lower  domain  boundaries  are  so  close  to  each  other  in  this  region, 
contours  are  not  discemable  in  the  insets;  they  lie  near  to  the  numbers. 

n/2  (corresponding  to  N  =  2 M),  the  kurtosis  is  a  minimum  at  the  top  of  the  domain, 
equalling  the  value  for  the  binomial  distribution  (i.e.  3  -  n/2  or  2.80  for  n  =  10).  The 
kurtosis  rises  slowly  as  one  moves  down  along  the  line  p^  =  n/2. 

3.3.6  Less  Well  Known  Distributions 

This  Section  briefly  describes  several  other  discrete  distributions  with  a  finite  domain 
[0 ,n].  The  study  of  some  of  these  has  a  long  history,  but  they  nevertheless  do  not  ap¬ 
pear  in  most  compilations  of  distributions.  The  purpose  of  this  Section  is  to  promote 
awareness  of  the  distributions.  The  expression  of  their  parameters  in  terms  of  their 
moments  is  carried  through  only  where  it  is  simple. 

(a)  Distributions  zoith  Zero  Parameters 

As  explained  in  §3.3.1,  frequently  the  range  n  of  the  distribution  domain  is  set  by  exter¬ 
nal  considerations;  it  is  not  available  for  variation.  Hence,  distributions  with  n  as  the 
only  variable  other  than  the  argument  are  considered  in  this  Report  to  have  no  parame¬ 
ters.  Lest  it  be  thought  that  the  discrete  uniform  distribution  is  the  only  one  such,  three 
others  are  described  here. 

The  ''classical  matching  distribution'  is  of  some  antiquity.  It  is  the  answer  to  the 
following  question:  n  entities  are  numbered  consecutively  1 ,...,«  and  are  then  rearrang¬ 
ed  at  random.  What  is  the  probability  that  t  of  them  will  have  a  position  in  the  random 
sequence  that  matches  their  number?  The  result  is  [25(p.87),26(p.409)](m) 

=  (0  <t<n).  (3.32) 

t‘ j= 0  ]■ 

The  moments  of  this  distribution  are:  p  =  1,  cj  =  1,  TI3  =  1,  t|4  =  4.  This  distribution  is  the 
finite-domain  equivalent  of  the  Poisson  distribution,  in  the  sense  that  the  first  n  factori¬ 
al  moments  of  the  two  distributions  are  equal  [26(p,410)], 

(m)  The  equation  in  Ref.  26  (Eqn  10.19)  contains  a  misprint  in  the  lower  limit  of  the  sum. 
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Figure  3.7  shows  the  classical  matching  distribution  with  n  =  10  (left-most  bar  in 
each  group).  The  values  of  pcm(0)  and  pcm(l)  are  almost  equal,  after  which  pcm  values 
fall  rapidly  with  increasing  t,  as  the  inset  indicates.  The  last  values  are  pcm(n  -  1)  =  0 
311(1  Pcm(w)  =  1M 

Naor's  distribution  [26(p.447)]  belongs  to  a  type  known  as  'urn  models'  [24].  It 
gives  the  probability  that  t  attempts  are  required  to  draw  a  red  ball  from  a  urn  that  ini¬ 
tially  contains  one  red  ball  and  n  -  1  white  balls,  under  the  condition  that,  every  time  a 
white  ball  is  drawn,  it  is  replaced  with  a  red  ball.  Clearly,  this  sort  of  problem  admits 
almost  endless  variation.  As  described  above,  the  domain  of  Naor's  distribution  is  1  <  t 
<n.  A  straightforward  change  of  variable  gives 


Pn(*)  = 


n\(t  + 1) 
(n-f)!(n  +  l)f+1 


(0  <  f  <  n) 


(3.33) 


Despite  its  simple  form,  there  are  apparently  no  known  closed-form  expressions  for  the 
moments  of  this  distribution.  Figure  3.7  shows  the  distribution  with  n  =  10  (right-most 
bar  in  each  group).  The  distribution  rises  to  a  maximum  at  n  =  2  and  then  falls,  slowly 
at  first.  The  last  value  is  pN(n)  =  n\/(n  +  l)n. 

A  more  bizarre  example  of  a  parameter  less  distribution  is  'Haight's  harmonic 
distribution',  which  is  [26(p.470)] 


PHh(t)=  — 


2Z 
2f +  1 


2Z 

2f +  3. 


(0<f<[Z-l/2j), 


(3.34) 


where  2Z  is  a  positive  integer  and  [_xj  is  the  largest  integer  less  than  or  equal  to  x.  There 
does  not  seem  to  be  any  particular  application  behind  the  definition  of  this  distribu¬ 
tion.  It  has  the  property  that,  for  large  Z  values,  Pnh(0  is  zero  f°r  considerable  ranges 
of  f,  with  isolated  nonzero  values.  Figure  3.7  shows  something  of  this  behaviour  (mid¬ 
dle  bars  of  each  group),  although  n  =  10  is  not  large  enough  for  it  to  be  fully  developed. 
Expressions  for  the  mean  and  standard  deviation  of  this  distribution  are  available  [26 
(PP -470-1)]. 


classical  matching 
Haight  harmonic,  Z  =  21/2 
Haight  harmonic,  Z  =  11 
Naor 


Figure  3.7:  Four  parameterless  distributions  with  n  =  10.  The  inset  shows  detail  of  the  classical 
matching  distribution  for  6  <t  <10.  See  text  for  fiirther  comments. 
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It  is  not  quite  right  to  describe  Haight's  harmonic  distribution  as  parameterless:  if 
one  wishes  an  upper  limit  of  n,  then  Z  =  n  +  V 2  and  Z  =  n  +  1  are  both  possibilities  and 
give  different  distributions,  as  Figure  3.7  shows.  (In  this  case— n  =  10  — the  most  pro¬ 
nounced  difference  is  that  Pnh(4)  =  0  for  Z  =  11,  but  Pnh(^)  =  0  f°r  2  =  21/2.)  In  this 
sense,  a  parameter  variation,  albeit  highly  restricted,  can  be  performed. 

(b)  One-Parameter  Distributions 

The  'riff-shuffle  distribution'  is  related  to  a  problem  in  combining  two  packs  of  cards. 
Its  definition  is  [26  (p.234)]  (n) 

Pr-s(£)  =  (”  7  f)[«"+1  (1  -  a)f  +«'  a ]  (0<(  <n),  (3.35) 

where  0  <  a  <  1.  This  distribution  is  related  to  the  negative  binomial  distribution.  It  is 
symmetric  about  a  =  0.5  and  tends  to  8f  0  as  a  — »  0  or  as  a  — >  l.(°)  Expressions  for  the 
moments  are  available  in  terms  of  incomplete  (3  functions  [28],  Examples  of  the  distri¬ 
bution  for  n  =  10  are  shown  in  Figure  3.8. 

Dandekar  introduced  several  distributions  related  to  the  binomial  and  Poisson 
distributions,  in  which  the  probability  of  a  success  in  a  trial  is  set  to  zero  for  a  specified 
number  of  trials  following  a  successful  trial.  His  first  modified  Poisson  distribution  has 
one  parameter  only  [25(p.25),26(p.435)]: 

ifapffl - f  EPffCl.  (0<f<LvJ),  (3.36) 

l=o  1 

where  X  and  v  are  both  greater  than  zero  and  [rj  is  the  largest  integer  less  than  or  equal 
to  x.  To  have  any  members  in  the  distribution,  v  >  1.  As  with  Haight's  harmonic  distri¬ 
bution,  some  variation  of  v  is  available  once  the  maximum  range  n  has  been  chosen: 
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IL 


Ml 


10 


Figure  3.8:  Examples  of  the  riff-shuffle  distribution  for  n  =  10.  Calculated  values  of  the  mean 
are  shown  in  the  legend.  Tire  distribution  tends  to  6t  0  as  a  — »  0  or  a  — » 1. 


(n)  The  distribution  is  incorrectly  stated  by  Johnson  et  al.  [26(Eqn  5.92)],  see  Ref.  28. 

(o)  n  is  the  'Kronecker  delta':  8m  n  =  1  if  m  =  n  and  zero  otherwise. 
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n  <  v  <  n  +  1,  although  in  this  case,  the  effect  of  varying  v  over  this  range  is  relatively 
minor.  As  A  —*■  0,  the  distribution  degenerates  to  Sf0.  It  seems  that  no  simple  expres¬ 
sions  for  the  moments  are  known. 

Note  that  Equation  (3.36)  gives  the  cumulative  probability  distribution.  The  usual 
probability  distribution  can  be  calculated  by  taking  successive  differences.  Figure  3.9 
shows  some  cases  for  v  =  10.0.  On  the  basis  of  this  cursory  numerical  inspection,  it 
appears  that  the  distribution  approaches  8f  n  as  A  — >  oo. 

The  Bose-Einstein  distribution  comes  from  quantum  statistical  mechanics.  Of  the 
three  state-occupancy  distributions  of  statistical  mechanics,  it  is  the  only  one  relevant 
here;  the  Maxwell-Boltzmann  distribution  is  equivalent  to  the  binomial  distribution 
and  the  Fermi-Dirac  distribution  is  defined  for  0  <  t  <  1  only.  All  three  distributions  are 
examples  of  urn  models.  Using  notation  consistent  with  that  adopted  in  this  Report, 
the  Bose-Einstein  distribution  of  state  occupancies  can  be  written  [24(p.ll2),26(p.421)] 


Pb-e(0  = 


(N-l)n\(n  +  N-t-2)\ 

(»  +  N-l)!(n-f)! 


(0  <  t  <  n ), 


(3.37) 


where  N  >  2  and  is  integral.  This  is  one  case  where  expressions  for  the  mean  and  vari¬ 
ance  are  simple.  That  for  the  mean,  p  =  n/N,  gives  an  expression  for  N,  which  leads  to 
the  variance  as 

^(k-hXh+D 

n  +  p 

In  statistical  mechanics,  n  is  the  number  of  particles  and  N  the  number  of  states, 
both  of  which  are  always  very  large  in  physics  applications,  but  the  distribution  is  per- 
ectly  well  defined  for  small  values  also.  Figure  3.10  shows  four  examples  with  n  =  10. 

The  Bose-Einstein  distribution  becomes  the  discrete  uniform  distribution  when 
N  =  2;  as  N  — »■  oo,  it  degenerates  to  8f  0. 
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Figure  3.9:  Examples  of  Dandekar's  first  modified  Poisson  distribution  with  v  =  10.  Calculated 
values  of  the  mean  are  shown  in  the  legend.  The  distribution  tends  to  5t  0  as  A  — »  0  and  8t  w  as 
A  — >  oo. 
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Figure  3.10:  Examples  of  the  Bose-Einstein  distribution  for  n  =  10.  The  distribution  equals  the 
discrete  uniform  distribution  xuhen  N  =  2  and  tends  to  5t  0  as  N  — » <x>. 


(c)  Distributions  with  Tivo  or  More  Parameters 

The  Laplace-Haag  matching  distribution  is  a  two-parameter  generalisation  of  the  clas¬ 
sical  matching  distribution  [26 (p. 410)]: 


Pl-h(*)  = 


n! 

Nltt 


y(j nUzlzHgt+l 

p0  j  <(n-t-j)l 


(0  <t<n), 


(3.39) 


where  a  >  0  and  N  >  max  ( n ,  na).  In  the  original  Laplace  version  of  the  distribution,  a  is 
integral  and  N  =  na.  However,  the  distribution  is  well  defined  for  any  positive  a,  in¬ 
cluding  a  <  1. 

A  general  expression  for  the  factorial  moments  of  the  Laplace-Haag  distribution 
is  known  [26(p,410)].  This  leads  to  expressions  for  the  mean  and  variance  that  can  be 
readily  inverted: 

p(p2  +  o2-p) 

a  =  5 N=na/p.  (3.40) 

naz  +  p  -  np 

The  condition  N  >  na  on  parameter  N  means  that  p  <  1,  regardless  of  the  value  of  n. 

The  'binomial  distribution  of  order  V  is  an  example  of  a  class  of  distributions  that 
has  been  much  studied  in  the  last  two  decades.  It  is  [26(p.431)] 


1  >*1+-+** 
--1 

xV  J 


j~u  xi  xk 

where  0<  p  <  1, 

(x1  +".  +  Xic+t\_{t+Hl=lXl)' 


0<t< 


*l . xk't  J 


is  the  multinomial  symbol  and  the  inner  summations  in  Equation  (3.41)  run  over  all 
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non-negative  subscript  values  that  satisfy 

k 

'£lxi=n-j-kt.  (3.43) 

1=1 


This  is  effectively  a  2-parameter  distribution.  Despite  the  complication  of  its  definition, 
relatively  simple  expressions  for  its  mean  and  variance  are  known  [26  (p .431)],  but  they 
cannot  be  readily  solved  in  favour  of  p  and  k. 

The  'specified  occupancy  distribution'  [26(p.416)]  is  a  two-parameter  generalisa¬ 
tion  of  the  classical  occupancy  distribution: 

(0^4  (3.44) 


where  b  >  0  and  c  >  n  are  both  integers.  Once  again,  expressions  for  the  moments  are 
available  but  they  cannot  be  inverted  in  closed  form. 

As  a  final  example,  the  'multinomial  distribution',  a  generalisation  of  the  binomi¬ 
al  distribution,  has  as  many  parameters  as  one  desires.  Its  definition  is  [25(p.l04),26(p. 
460)] 


(0  <  t  <  ns). 


(3.45) 


where  the  pj,  each  separately  satisfying  0  <  p  •  <  1,  also  satisfy  the  third  condition  below 
and  the  summations  run  over  all  non-negative  integers  that  satisfy  the  first  and  second 
following  conditions: 

s  s  s 

2>/=1-  (3-46) 

7=0  7=0  7=0 

As  a  consequence  of  the  last  condition,  only  s  of  the  s  +  1  values  of  pj  are  independent, 
but  s  itself  can  be  considered  a  parameter,  giving  s  +  1  parameters  in  all.  The  case  s  =  1 
is  the  binomial  distribution.  Quite  simple  expressions  for  the  moments  are  known,  but 
the  task  of  inverting  s  +  1  equations  is  substantial. 


4.  Conclusion 

This  report  collects,  summarises  and  extends  the  knowledge  of  the  properties  of  skew¬ 
ed  probability  distributions,  with  the  chief  goal  of  providing  a  means  of  comparing 
among  them.  The  motivation  for  this  was  drawn  from  various  studies  in  operational 
analysis  in  which  random  variables  are  employed  without  any  firm  information  on 
their  probability  distributions.  In  this  situation,  it  is  of  interest  to  look  for  sensitivity  to 
the  distribution  chosen.  This  can  best  be  done  by  running  the  model  with  a  variety  of 
distributions,  which  then  raises  the  question  of  how  to  compare  them.  The  work  began 
with  the  premise  that  a  viable  method  for  such  comparison  is  to  match  distribution 
moments,  but  this  requires  access  to  the  requisite  equations.  Surprisingly,  these  are 
almost  completely  absent  from  compilations  of  information  on  probability  distribu¬ 
tions.  Hence,  the  main  task  addressed  in  this  report— and  addressed  successfully— is 
the  derivation  and  compilation  of  the  required  equations  for  common  skewed  distri¬ 
butions.  A  total  of  18  distributions  are  treated  in  detail,  both  finite  and  semi-infinite. 
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discrete  and  continuous.  These  include  three  non-skewed  distributions,  for  compari¬ 
son.  An  additional  11  discrete  distributions  are  briefly  mentioned. 

In  the  process,  it  is  noted  that  most  distributions  have  a  limited  accessible  range 
of  the  coefficient  of  variation  (the  ratio  af  p  of  standard  deviation  to  mean).  A  signi¬ 
ficant  part  of  this  work  is  the  determination  of  these  limits.  The  results  are  presented 
above  for  each  distribution;  they  are  also  collected  in  Table  2.5,  Table  3.1  and  Figure 
3.4.  The  limits  typically  place  limits  on  accessible  values  of  the  higher  moments,  which 
are  the  main  indicators  of  difference  between  distributions  with  equal  mean  and  stan¬ 
dard  deviation. 

The  distributions  treated  in  this  report  comprise  all  of  the  most  commonly  used 
skewed  distributions.  The  collected  formulae  and  graphs  should  be  of  use  to  anyone 
using  skewed  random  variates  who  needs  a  common  and  consistent  basis  for  compar¬ 
ing  one  distribution  with  another. 
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